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DNA POLYMERASE HAVING ABILITY TO REDUCE INNATE 
SELECTIVE DISCRIMINATION AGAINST 
FLUORESCENT DYE - LABELED D I D E OX YNUCLE OT IDES 

BACKGROUND OF THE INVENTION 
The genetic material of all known living 
organisms is deoxyribonucleic acid (DNA) , except 
in certain viruses whose genetic material may be 
ribonucleic acid (RNA) . DNA consists of a chain of 
individual deoxynucleo tides chemically linked in 
specific sequences. Each deoxynucleotide contains 
one of the four nitrogenous bases which may be 
adenine (A) , cytosine (C) , guanine (G) or thymine 
(T) , and a deoxyribose, which is a pentose, with a 
hydroxyl group attached to its 3' position and a 
phosphate group attached to its 5' position. The 
contiguous deoxynucleo tides that form the DNA 
chain are connected to each other by a 
phosphodiester bond linking the 5' position of one 
pentose ring to the 3' position of the next 
pentose ring in such a manner that the beginning 
of the DNA molecule always has a phosphate group 
attached to the 5' carbon of a deoxyribose . The 
end of the DNA molecule always has an OH 
(hydroxyl) group on the 3' carbon of a 
deoxyribose , 

DNA usually exists as a double -stranded 
molecule in which two antiparallel DNA strands are 
held together by hydrogen bonds between the bases 
of the individual nucleotides of the two DNA 
strands in a strictly matched "A-T " and "C-G" 
pairing manner. It is the order or sequence of the 
bases in a strand of DNA that determines a gene 
which in turn determines the type of protein to be 
synthesized. Therefore, the accurate determination 
of the sequence of the bases in a DNA strand which 
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also constitutes the genetic code for a protein is 
of fundamental importance in understanding the 
characteristics of the protein concerned. 

. . The process used to determine the 

sequence of the bases in a DNA molecule is 
referred to as DNA sequencing. Among the 
techniques of DNA sequencing, the enzymatic method 
developed by Sanger et al . (1) is most popular. It 
is based on the ability of a DNA polymerase to 
extend a primer annealed to the DNA template to be 
sequenced in the presence of four normal 
deoxynucleotide triphosphates (dNTPs) , namely, 
dATP, dCTP, dGTP and dTTP, and on the ability of 
the nucleotide analogs, the dideoxynucleotide 
triphosphates (ddNTPs) , namely, ddATP, ddCTP, 
ddGTP and ddTTP, to terminate the extension of the 
elongating deoxynucleotide polymers at various 
lengths . 

In the classic one-step Sanger method, 
the sequence determination is carried out in a set 
of four separate tubes, each containing all four 
normal dNTPs , one of which is labeled with a 
radioactive isotope, 32 P or 35 S, for 
autoradiographic localization, a limiting amount 
of one of the four ddNTPs, a DNA polymerase, a 
primer, and the DNA template to be sequenced. As a 
result of the DNA polymerase activity, individual 
nucleotides or nucleotide analogs are added to the 
new DNA chains, all starting from the 3' end of 
the primer in a 5' -3' direction, and each linked 
to adjacent ones with a phosphodiester bond in a 
base sequence complementary to the DNA sequence of 
the template. Inasmuch as there is a nucleotide 
analog in the reaction mixture, each tube 
eventually contains numerous newly formed DNA 
strands of various lengths, all ending in a 
particular ddNTP, referred to as A, C, G or T 
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terminator . 

After resolving the four sets of 
reaction products by high-resolution 
polyacrylamide/urea gel electrophoresis, the 
populations of the newly formed DNA strands are 
separated and grouped according to their molecular 
weight. An autoradiographic image of the gel will 
show the relative positions of these DNA strands 
as bands which differ from one another in distance 
measured by one nucleotide in length, all sharing 
an identical primer and terminating with a 
particular ddNTP (A, C ( G or T) . By reading the 
relative positions of these bands in the "ladder" 
of the autoradiography the DNA sequence of the 
template can be deduced. 

The DNA polymerase used in the reaction 
mixture plays a pivotal role in DNA sequencing 
analysis. To be useful for DNA sequencing, a DNA 
polymerase must possess certain essential 
properties. For example, it must have its natural 
5' -3' exonuclease activity removed by mutagenesis 
or by posttranslational modification, such as 
enzymatic digestion, and must be able to 
incorporate dNTPs and ddNTPs, without undue 
discrimination against ddNTP and with a 
sufficiently high processivity which refers to the 
ability of the enzyme to polymerize nucleotides 
onto a DNA chain continuously without being 
dislodged from the chain, and a sufficiently high 
elongation rate. A 5' -3' exonuclease activity 
associated with a DNA polymerase will remove 
nucleotides from the primer, thus cause a 
heterogeneous 5' end for the newly formed DNA 
strands, resulting in a false reading of the 
strand lengths on the sequencing gel. A DNA 
polymerase with a low processivity and a low 
elongation rate will cause many undesirable noise 
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background bands of radioactivity due to the 
presence of DNA strands which are formed with 
improper lengths and improper terminations. Among 
the more commonly used DNA polymerases, Sequenase™ 
5 has a higher processivity and a higher elongation 

rate than others, such as the Klenow fragment, 
Taq, and Vent polymerases (2), and is therefore 
one of the most popular DNA polymerase selected 
for DNA sequencing to-date. 
10 However, even when a DNA polymerase has 

been endowed with all the essential properties 
listed above, it may still generate erroneous or 
misleading band patterns of radioactivity in the 
sequencing gel. These artif actual patterns do not 
15 faithfully reflect the true nucleotide sequence in 

the template being sequenced. They may be caused 
by premature termination of the elongating strands 
due to the presence of secondary structures formed 
along the template, such as « hairpins " in the 
20 regions that contain palindromic sequences or that 

are rich in G and C bases (3) ; or, they may occur 
as a result of inadequate " proof-reading " 
function of the DNA polymerase that will allow the 
removal of misincorporated nucleotides at the 3' 
25 end of an elongating strand. 

Researchers in the field of DNA 
sequencing often have to use several approaches to 
confirm their findings in order to avoid being 
misled by these potentially erroneous sequence 
30 data. For example, they sometimes rely on 

repeating the same sequencing experiment with 
different DNA polymerases, or performing another 
sequencing reaction with the template which is 
complementary to the first single-stranded DNA 
3 5 template, and compare the results for possible 

discrepancies . 

Numerous investigators have tried to 
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find an ideal DNA polymerase for enzymatic 
sequencing, i.e. an enzyme that not only has all 
the essential properties required for sequencing 
reaction, but also is capable of resolving the 
5 secondary hairpin structures and preventing the 

formation of strands containing nucleotides non- 
complementary to those of the template being 
sequenced . 

The discovery by Ye and Hong (4) of the 
10 thermostable large fragment of DNA polymerase 

isolated from Bacillus stearo thermophilics (Bst) , 
an enzyme that is functional over the temperature 
range between 25°C and 75°C, but is most active at 
65°C, and possesses all the essential properties 
15 for DNA sequencing, has largely solved the problem 

caused by secondary structures in the template 
since these secondary structures are destabilized 
when the sequencing reaction is carried out at 
65°C. In the past few years since this enzyme was 
20 made commercially available under the name of Bst 

DNA Polymerase {Bio-Rad Laboratories), independent 
reports have confirmed that during sequencing 
reaction catalyzed by this enzyme all four dNTPs, 
including dCTP, and other nucleotide analogs, such 
25 as dITP and 7-deaza-dGTP , are incorporated equally 

effectively in the chain elongation, thus 
eliminating the weak band phenomena often 

observed when other DNA polymerases are used, and 
producing a very good band uniformity on the 
30 sequencing gel. It has been further established 

that at this elevated temperature Bst DNA 
polymerase system can be used both for the classic 
Sanger one -step reaction as well as for the 
"labeling/termination" sequencing reaction, 
35 double-stranded DNA sequencing, and the 

incorporation of 35 S-labeled nucleotides, and 32 P- 
labeled nucleotides. Since this system can be 
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placed at room temperature for at least two weeks 
without significant loss of its enzymatic 
activity, it has been adapted for automation of 
DNA sequencing which requires a stable DNA 
5 polymerase, using either fluorescent dye or 

radioactive isotope labeling. (See also 9, 12, and 
13.) 

However, when this Bst enzyme is used 
for automated fluorescent DNA sequencing, only 

10 partially satisfactory results have been obtained 

with fluorescent dye-labeled primers (see 12 and 
EG Bulletin 1771 of Bio-Rad Laboratories) , and 
even less satisfactory results are obtained with 
fluorescent dye -labeled ddNTP terminators. Even 

15 when fluorescent dye- labeled primers are used, a 

significant number of mismatched ddNTPs are 
incorporated onto the 3' end of the extending 
nucleotides in the enzymatic reaction, thus 
generating erroneous sequencing data (see Bio-Rad 

20 EG Bulletin 1771) . With this in mind, the 

inventors sought, and found, a better DNA 
polymerase for DNA sequencing, especially for 
automated fluorescent dye- labeled primer and 
fluorescent dye- labeled terminator sequencing. 

25 Another disadvantage of the Bst DNA 

polymerase currently known in the art is its lack 
of 3' -5' exonuclease activity (5), and 
specifically, proof-reading 3' -5' exonuclease 
activity. A survey of the sequencing data 

30 collected from fourteen research centers which 

have used this Bst DNA polymerase for their DNA 
sequencing work on over 120 DNA clones showed 
that, statistically, base pair mismatching occurs 
at a rate of about 1.5 x 10" 5 . That is, 

35 approximately 1.5 errors can be expected in one 

hundred thousand nucleotide incorporations during 
nucleotide polymerization catalyzed by the enzyme. 



6 



BNSDOCID; <WO 0017330A1_L> 



PCT/1B99/00146 

WO 00/17330 

It is generally known that the formation 
of incorrect DNA sequences due to mismatching of 
base pairs between the template and the growing 
nucleotide chain in DNA sequencing may be 

5 prevented by a 3' -5' exonuclease activity which 

"proof-reads" the nucleotide chain. However, even 
if a DNA polymerase exhibits 3' -5' exonuclease 
activity in vitro, it is often the case that the 
polymerase will not adequately "proof-read". 

10 Thus, the polymerase will not be capable of 

removing mismatched nucleotides from a newly 
formed DNA strand as efficiently as those 
nucleotides correctly matched with the nucleotides 
of the template. In other words, a 3' -5' 

15 exonuclease may excise the correctly matched 

nucleotides at a faster rate than the mismatched 
ones from the 3' terminus, or excise both the 
correctly matched and the mismatched nucleotides 
at the same rate. Consequently, even where the 

20 DNA polymerase has 3' -5' exonuclease activity, it 

does not perform any useful proof-reading function 
during DNA polymerization. 

It is also known that a 3' -5' 
exonuclease activity associated with a DNA 

2 5 polymerase, in the presence of low concentrations 

of dNTPs, often counteracts the normal chain 
elongation process catalyzed by the polymerase, 
induces cyclic incorporation and degradation of 
nucleotides over the same segment of template, or 
30 even operates more efficiently than the polymerase 

activity per se, to the extent of causing 
degradation of the primer. Consequently, removal 
of the 3' -5' exonuclease activity along with the 
5' -3' exonuclease activity from the native DNA 

3 5 polymerases by chemical means or by genetic 

engineering techniques has become a standard 
procedure in producing DNA polymerases for 
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sequencing. This is a common strategy to preserve 
the essential properties of a DNA polymerase. 

For example, among the major 
commercially available sequencing enzymes (other 
than the native Taq (Thermus aquaticus) DNA 
polymerase which lacks a 3' -5' exonuclease 
activity de novo) the 3' -5' exonuclease activity 
has been removed from the native T7 DNA 
polymerase, which lacks a 5' -3' exonuclease, 
either by a chemical reaction that oxidizes the 
amino acid residues essential for the exonuclease 
activity (Sequenase™ Version 1) or genetically by 
deleting 28 amino acids essential for the 3' -5' 
exonuclease activity (Sequenase™ 2) . 

Vent R (exo~) DNA polymerase, which is 
recommended as the preferred form of the Vent DNA 
polymerase for sequencing, also has its 3' -5' 
exonuclease activity removed by genetic 
modification. The native Vent DNA polymerase and 
the Klenow fragment isolated from the native E^ 
coli DNA polymerase I possess a 3' -5' exonuclease; 
but these enzymes are no longer considered the 
enzymes of choice for DNA sequencing. 

The currently known Bst DNA polymerase 
(e.g., produced by Bio-Rad Laboratories) isolated 
and purified from the cells of Bacillus 
stearothermophilus for DNA sequencing is free of 
3' -5' exonuclease activity (5). 

IsoTherm™ DNA Polymerase, a commercially 
available Bst DNA polymerase for DNA sequencing, 
marketed by Epicentre Technologies (1402 Emil 
Street, Madison, WI 53713) , is also based on a Bst 
DNA polymerase whose 3' -5' exonuclease activity 
has been enzymatically removed (6) . 

Only the rBst DNA Polymerase produced 
from an over- expressing recombinant clone in Ej_ 
coli , which is the product of the DNA pol I gene 
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of Bacillus stearothermophilus, possesses a 3' -5' 
exonuclease activity in addition to a 5' -3' 
exonuclease activity. However, due to the 
existence of an undesirable 5' -3' exonuclease 
activity and a 3' -5' exonuclease activity of 
unknown characteristics, the latter product is not 
recommended by the company for DNA sequencing (6) . 

Over the past 10 years there has been a 
trend to develop and improve the automated 
fluorescent DNA sequencing technology to replace 
the classic radioactive isotope labeling manual 
method for DNA sequencing because of the potential 
harmful effects of the radioactive materials to 
humans and because of the need for automated high 
throughput DNA sequencing systems. In using 
fluorescent dyes as markers for labeling the DNA 
strands generated in enzymatic reactions for 
sequencing, the dyes can be either coupled with 
the primer, or coupled with the ddNTP terminators, 
namely the dye-labeled ddATP, dye-labeled ddCTP, 
dye-labeled ddGTP and dye-labeled ddTTP. 
Sequencing techniques based on these two forms of 
labeling of the final enzymatic reaction products 
are commonly referred to as "dye primer 
sequencing" and "dye terminator sequencing", 
respectively. 

In the dye primer sequencing, ddNTPs are 
employed as the chain terminators, as in the 
original classic Sanger method which uses 
radioactive isotope as the marker. The molecular 
structure of ddNTPs are almost identical to that 
of dNTPs, the natural building blocks of all DNA 
molecules. Therefore, any DNA polymerase which 
has been used for radioactive isotope manual DNA 
sequencing can be easily adapted for fluorescent 
dye primer DNA sequencing with equally 
satisfactory results. The disadvantage in the dye 
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primer technology is that the primer for each 
template to be sequenced must be labeled with four 
different fluorescent dyes and that the enzymatic 
reaction must be performed in four separate test 
tubes each containing only one of the ddNTPs, 
namely ddATP, ddCTP, ddGTP or ddTTP, as in the 
classic Sanger radioisotope method. 

In the dye terminator technology for DNA 
sequencing, the fluorescent dye-labeled ddATP, 
dye -labeled ddCTP, dye -labeled ddGTP and dye- 
labeled ddTTP are coupled with different 
fluorescent dyes, each emitting a specific light 
spectrum, thus directly reporting the type of 
ddNTP at the 3' terminus of the DNA fragment. 
Unlike the situations in the dye primer technology 
in which four different fluorescent dyes are 
coupled to a primer incorporated into all newly 
formed DNA strands, these dye- labeled ddNTPs serve 
the dual function of a specific base terminator 
and a "color marker" . There is no need to label 
the primer for each new template, and the 
polymerase DNA extension reaction can be performed 
in a single test tube to generate the required 
specifically terminated and specifically dye- 
labeled DNA fragments of various sizes for DNA 
sequencing . 

The advantage of using fluorescent dye- 
labeled terminators for DNA sequencing is obvious. 
However, there are certain difficulties to 
overcome before an enzymatic reaction system 
suitable for a radioisotope technique or suitable 
for a dye primer technique can be adapted for a 
dye terminator technology. An increase of the 
molecular weight from less than 500 for a ddNTP 
terminator to about 800 or more for a fluorescent 
dye-labeled ddNTP terminator may be associated 
with potential three-dimensional structural 
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changes. These molecular alterations may 
interfere with the process of incorporation of the 
dye- labeled ddNTPs as chain terminators by the DNA 
polymerase to the 3' end of an extending DNA 
5 strand in terms of lowering the rate of 

incorporation, lowering the processivity of the 
enzyme for this new substrate, reducing the 
enzyme- terminator binding specificity and changing 
the enzyme -terminator binding kinetics. 
10 For example, both Taq DNA polymerase and 

Sequenase II™ (a T7 DNA polymerase) have been used 
for radioisotope labeling DNA sequencing with 
excellent results, and have been adapted for 
fluorescent dye-labeled primer DNA sequencing. 
15 But neither can be used for fluorescent dye- 

labeled terminator DNA sequencing technologies. 
As reported in U.S. Patent 5,614,365, when the Taq 
DNA polymerase was used for fluorescent dye- 
labeled terminator chemical reactions, the 
20 reaction products generated no readable data on 

the DNA sequencer. Most of the fluorescence was 
either in unincorporated dye-ddNTPs at the leading 
front of the test gel, or in fragments greater 
than several hundred bases in length. Using a Taq 
25 DNA polymerase mutant in which the amino acid, 

phenylalanine, at position 667 of its amino acid 
sequence has been replaced by a tyrosine and which 
has an increased ability to incorporate 
dideoxynucleotides (6,000 times more efficient), 
3 0 to replace the unmodified Taq DNA polymerase for 

the experiment, the results are significantly 
improved. This F667Y mutant of Taq DNA polymerase 
is now marketed by Amersham Life Science, Inc. 
under the trademark ThermoSequenase™. It is used 
35 for cycle -sequencing in which the enzymatic 

reaction mixture is subjected to numerous cycles 
of extension- termination, denaturing and annealing 
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to ensure that sufficient dye-terminator-labeled 
enzymatic reaction products are generated for the 
DNA sequencing procedure . Because of the low 
processivity of the parent Taq DNA polymerase, 
ThermoSequenase™ is not recommended for direct DNA 
sequencing without precyclings. Like Taq DNA 
polymerase, ThermoSequenase™ lacks a proof-reading 
exonuclease activity. 

Bacillus stearothermophilus , Bacillus 
caldotenax and Bacillus caldolyticus are 
classified as mesophilic microbes; although their 
DNA polymerases are referred to as thermostable 
(most active at 65 *C) they are inactivated at 70 "C 
or above. This is contrasted with other enzymes, 
such as Taq, which are truly thermophilic-- that 
is, their DNA polymerases tolerate and remain 
active at temperatures higher than 95 *C. These 
mesophilic bacillus strains, especially Bacillus 
stearothermophilus , produce DNA polymerases that 
are useful in DNA sequencing applications. 
However, a disadvantage of the DNA polymerases of 
these strains is that during DNA sequencing they 
all exhibit a high degree of selective 
discrimination against incorporation of certain 
particular members of fluorescent dye -labeled 
ddNTPs, namely the fluorescent dye -labeled ddCTP 
and fluorescent dye -labeled ddATP, as terminators 
onto the 3' end of the extending DNA fragments 
during enzymatic reaction. This peculiar 
characteristic of selective discrimination against 
incorporation of fluorescent dye- labeled ddCTP and 
ddATP of the natural DNA polymerases isolated from 
Bacillus stearothermophilus and Bacillus 
caldotenax was not previously recognized. Such 
selective discrimination is apparently sequence- 
related, and cannot be corrected or compensated by 
mere adjustment of the concentrations of the 
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dNTPs . 

Thus, there is a need for a mesophilic 
bacillus DNA polymerase that does not selectively 
discriminate against incorporation of fluorescent 
dye- labeled ddCTP and ddATP, during dye terminator 
DNA sequencing. 

SUMMARY OF THE INVENTION 
This invention addresses the above - 
described problems associated with mesophilic 
bacillus DNA polymerases by providing novel DNA 
polymerases which, during direct DNA sequencing, 
reduce the innate selective discrimination against 
the incorporation of fluorescent dye -labeled ddCTP 
and fluorescent dye- labeled ddATP, without 
increasing the rate of incorporation of the other 
two dye -labeled ddNTP terminators (ddTTP and 
ddGTP) excessively. In particular, this invention 
provides a novel genetic modification of the amino 
acid sequence of a highly processive DNA 
polymerase (such as isolated from Bacillus 
stearothermophilus , Bacillus caldotenax or 
Bacillus caldolyticus) that, unmodified, 
selectively discriminates against incorporation of 
fluorescent dye- labeled dideoxynucleot ide 
terminators ddATP and ddCTP (but does not 
discriminate against incorporation of fluorescent 
dye-labeled dideoxynucleot ide terminators ddTTP 
and ddGTP) . The modification results in a 
reduction of the innate selective discrimination 
against incorporation of fluorescent dye-labeled 
dideoxynucleot ide terminators ddATP and ddCTP, 
such that all four of the ddNTP terminators are 
effectively incorporated into the DNA primer 
elongated by the DNA polymerase. Thus, the 
modified DNA polymerase of this invention is 
effective in reducing the innate selective 
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discrimination against incorporation of 
fluorescent dye -labeled dideoxynucleotide 
terminators ddATP and ddCTP characteristic of the 
DNA polymerase in its unmodified state. 

In particular, the preferred DNA 
polymerase is a modification of a DNA polymerase 
isolated from a strain of a mesophilic bacterium, 
such as Bacillus stearo thermophilics , Bacillus 
caldotenax or Bacillus caldolyticus . The approach 
of modifying the DNA polymerase described herein 
may be used to modify other DNA polymerases which 
share a close amino acid homology of a DNA 
polymerase isolated from a strain Bacillus 
stearothermophilus , Bacillus caldotenax or 
Bacillus caldolyticus, as long as the unmodified 
DNA polymerases have a selective discrimination 
against incorporation of fluorescent dye -labeled 
dideoxynucleotide ddCTP and/or ddATP as 
terminators in the enzymatic reaction for 
preparing materials for automated fluroescent DNA 
sequencing. Consequently, it is preferred that 
the modified DNA polymerase has an amino acid 
sequence that shares not less than 95% homology of 
a DNA polymerase isolated from a strain of 
Bacillus stearothermophilus, Bacillus caldotenax 
or Bacillus caldolyticus. 

The particularly preferred mesophilic 
species is Bacillus stearothermophilus , which is 
highly heterogeneous. This is indicated by the 
wide range of DNA base compositions as well as the 
range of the phenotypic properties of strains 
assigned to this species (see Bergey's Manual of 
Systemic Bacteriology, Eds. P.H.A. Sneath, N.S. 
Mair, M.E. Sharpe and J.G. Holt, Williams & 
Wilkins, 1986, Vol. 2, page 1135). Therefore, it 
is reasonable to assume that the amino acid 
sequences of DNA polymerases isolated from various 
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strains would be heterogeneous with potential 
functional differences. Although DNA polymerases 
isolated from the known standard strains of 
Bacillus stearothermophilus have been shown to 
lack a 3' -5' exonuclease activity, a questionable 
trace of "contaminating" 3' 5' exonuclease has been 
observed in a purified DNA polymerase preparations 
(see Kaboev et al . , J. Bacteriology, Vol. 14 5, 
page 21-26, 1981) . 

Consequently, the inventors began to 
address the above- identified problems in the art 
by discovering a strain of Bacillus 
stearothermophilus {designated strain No. 32 0 for 
identification purposes; described in U.S. Patent 
5,747,298) that produces a DNA polymerase 
(designated Bst 320) with a proof-reading 3' -5' 
exonuclease activity which is absent in DNA 
polymerases isolated from other strains of 
Bacillus stearothermophilus. (For this invention, 
the term "proof-reading" is intended to denote 
that the DNA polymerase is capable of removing 
mismatched nucleotides from the 3' terminus of a 
newly formed DNA strand at a faster rate than the 
rate at which nucleotides correctly matched with 
the nucleotides of the template are removed during 
DNA sequencing.) The strain Bst 320 was deposited 
on October 30, 1995 in the American Type Culture 
Collection, located at 12301 Parklawn Drive, 
Rockville, Maryland 20852, and has been given ATCC 
Designation No. 55719. The DNA polymerase 
isolated from Bst 320 is composed of 587 amino 
acids as are the DNA polymerases of other known 
strains of Bacillus stearothermophilus, such as, 
for instance, the strains deposited by Riggs et al 
(Genbank Accession No. L42111) and by Phang et al . 
(Genbank Accession No. U23149) . However, the Bst 
320 shares only 89.1% sequence identity at protein 
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level with the Bacillus stearothermophilus DNA 
polymerase deposited by Riggs et al . , and shares 
only 87.4% sequence identity at protein level with 
the Bacillus stearothermophilus DNA polymerase 
deposited by Phang et al . For comparison, the 
above -referenced enzyme deposited by Riggs et al . 
and the enzyme deposited by Phang et al . share 
96.9% of their amino acid sequence identity. 

The inventors studied a thermostable DNA 
polymerase isolated from a different species, 
Bacillus caldotenax (Bca) , which also has an 
optimum active temperature at 65°C. The inventors 
discovered that the Bst 320 DNA polymerase shares 
88.4% of the amino acid sequence identity with Bca 
DNA polymerase (Uemori et al . J. Biochem. 113: 
401-410, 1993). Based on homology of the amino 
acid sequences, Bst 32 0 DNA polymerase is as close 
to DNA polymerases isolated from Bacillus 
stearothermophilus as to the DNA polymerase 
isolated from Bacillus caldotenax, i.e. another 
species of bacillus. It was also discovered that 
both Bst 320 DNA polymerase and Bca DNA polymerase 
functionally exhibit 3' -5' exonuclease activity, 
which is not associated with known amino acid 
sequence exonuclease motifs I, II and III as in 
the E. coli DNA polymerase I model, or other known 
Bacillus stearothermophilus polymerases. 

The inventors has studied the DNA 
polymerases of three different strains of Bacillus 
stearothermophilus (including DNA polymerase 
obtained from Bst 320) and the DNA polymerase of 
Bacillus caldotenax and found that they all 
exhibit a high degree of selective discrimination 
against incorporation of certain particular 
members of fluorescent dye -labeled ddNTPs, namely 
the fluorescent dye-labeled ddCTP and fluorescent 
dye-labeled ddATP, as terminators onto the 3' end 
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of the extending DNA fragments during enzymatic 
reaction. This is especially the case when the 
preceding 3 ' end base of the extending DNA 
fragment is a dGMP (G) or a dAMP (A) . (By "dNTP" 
5 it is intended to denote the four commonly known 

deoxynucleotide triphosphates, dATP, dTTP, dCTP, 
and dGTP . ) 

This selective discrimination causes 
missing peaks and ambiguous peaks on a color plot 
10 generated by the automated fluorescent DNA 

sequencer, and causes loss of sequencing data and 
erroneous base callings. This is shown in Figures 
6 and 8 . 

This disadvantage of the natural 
15 bacillus DNA polymerases in fluorescent dye- 

labeled terminator DNA sequencing cannot be 
corrected or compensated by mere adjustment of the 
concentrations of the dNTPs and the fluorescent 
dye -labeled ddNTPs in the reaction mixture. This 
20 selective discrimination against the specific dye- 

labeled ddNTPs is also sequence -related as 
demonstrated with respect to Bst in Figures 6 and 
8, in which the missing or ambiguous "C" peaks and 
"A" peaks tend to occur immediately following a 

2 5 preceding "G" peak or a preceding "A" peak. Of 

particular interest is the fact that the "C H and 
"A" peaks immediately following a preceding "C" or 
a preceding "T" peak are quite strong and 
resolvable in the same color plot analysis, 

3 0 indicating that the concentrations of dNTPs and 

the fluorescent dye -labeled ddCTP and the 
fluorescent dye -labeled ddATP were adequate for 
the termination reaction. 

According to the structural model 
3 5 studies carried out on E. coli DNA polymerase I 

{Klenow fragment) , certain amino acids in a 
particular region or regions of a DNA polymerase 
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appear to play important roles in dNTP and ddNTP 
bindings and their final incorporation, and affect 
discrimination between deoxy and dideoxynucleotide 
substrates. For example, mutation of the amino 
acids arginine, asparagine, lysine, tyrosine, 
phenylalanine, aspartate, and glutamate in certain 
locations of amino acid sequences of Klenow 
fragment may affect the binding of dNTP and 
discrimination between deoxy and 

dideoxynucleotides . (See: Joyce, CM., Current 
Opinion in Structural Biology, 1:123-129, 1991. 
Joyce and Steitz, Annu. Rev. Biochem. , 63:777-822, 
1993, page 800. Carrol et al . , Biochemistry 
30:804-813, 1991) . 

The problem which faced the inventors 
was how to reduce the selective discrimination 
against the incorporation of fluorescent dye- 
labeled ddCTP and fluorescent dye- labeled ddATP by 
site -directed mutagenesis of a DNA polymerase, 
without increasing the rate of incorporation of 
the other two dye-labeled ddNTP terminators 
excessively. In particular, the new mutant must 
be able to incorporate more correctly base-matched 
dye- labeled ddCTP and/ or dye- labeled ddATP 
terminators to the dGMP (G) and dAMP (A) bases, 
than to the dCMP (C) and dTMP (T) bases of the 
extending DNA fragments during enzymatic reaction. 
A blanket increase in the ability of an enzyme to 
incorporate all four dye- labeled ddNTPs to the 
same proportion would serve no useful purpose for 
the group of DNA polymerases isolated from 
mesophilic bacilli since, unlike the Taq DNA 
polymerase, the unmodified natural enzymes of 
Bacillus stearo thermophilics and Bacillus 
caldotenax already possess a high ability to 
incorporate fluorescent dye- labeled ddGTP and 
fluorescent dye -labeled ddTTP, and even the 
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fluorescent dye- labeled ddCTP and dye- labeled 
ddATP provided at the immediately preceding base 
at the 3 ' end of the extending DNA fragment is not 
a "G". or an "A". 

The inventors found that DNA polymerases 
isolated from strains of Bacillus 

stearo thermophilics and Bacillus caldotenax possess 
the same amino acids at certain specific positions 
in their amino acid sequence. For example, they 
all have leucine-glutamate-glutamate at positions 
corresponding to positions 342-344 and 
phenylalanine at a position 

corresponding to position 422 of the amino acid 
sequence of the DNA polymerase isolated from No 
320 strain of Bacillus stearothermophilus . The 
inventors further discovered that the most optimal 
modification to solve the problem of selective 
discrimination in direct fluorescent DNA 
sequencing for these DNA polymerases is to modify 
the four amino acids of the natural DNA 
polymerases referenced above in such a form that 
threonine-proline-leucine substitute respectively 
for leucine-glutamate-glutamate at positions 342- 
344 and tyrosine substitutes for phenylalanine at 
position 422 in their amino acid sequences. 
Accordingly, the nucleotide sequence encoding the 
natural forms of the DNA polymerases are modified 
at positions 1024-1032 from CTCGAAGAG to ACCCCACTG 
and at position 1265 from T to A to encode for the 
DNA polymerases having the desired properties. 
The combined effects of these amino acid 
modifications reduce the selective discrimination 
against incorporation of fluorescent dye- labeled 
ddCTP and dye- labeled ddATP of the naturally- 
occurring mesophilic bacillus DNA polymerases 
during enzymatic reaction for direct automated 
fluorescent DNA sequencing. 
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Initially, the DNA polymerases used in 
the inventors' research were obtained by 
overexpression of the genes encoding the 
naturally-occurring enzymes of Bacillus 
5 stearo thermophilics and Bacillus caldotenax. 

Subsequently, modified DNA polymerases obtained by 
overexpression of the site-directed mutated genes 
were used. This invention provides both the 
nucleotide and amino acid sequence for a modified 

10 DNA polymerase to illustrate the practice of this 

new approach of modifying a special group of DNA 
polymerases, as described below. 

In one preferred embodiment, the Bst 320 
DNA polymerase is used for the unmodified, 

15 naturally- occurring DNA polymerase, although DNA 

polymerases isolated from other strains of 
mesophilic bacilli (for instance, Bacillus 
stearothermophilus and Bacillus caldotenax) can be 
used as the starting enzymes for the genetic 

20 modification. As noted above, the Bst 32 0 DNA 

polymerase is also capable of proofreading 3' -5' 
exonuclease activity. In particular, the 
invention provides the DNA and amino acid 
sequences for the isolated and purified DNA 

25 polymerase having this function. These sequences 

are also described below. 

The invention also contemplates an 
isolated strain of Bacillus stearothermophilus 
which produces a DNA polymerase having an ability 

3 0 to reduce selective discrimination against 

incorporation of fluorescent dye- labeled 
dideoxynucleotide terminators ddCTP and ddATP, but 
not fluorescent dye- labeled dideoxynucleotide 
terminators ddGTP and ddTTP, in the presence of 

35 dNTPs and the four fluorescent dye- labeled 

dideoxynucleotide terminators. Preferably, the Bst 
strain produces a DNA polymerase which also has 
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proofreading 3' -5' exonuclease activity during DNA 
sequencing of a DNA strand from a template. 

As mentioned above, the invention also 
contemplates DNA polymerases obtained or otherwise 
derived from any bacillus strain, or made 
synthetically, as long as the amino acid sequences 
of the naturally- occurring DNA polymerases have 
leucine-glutamate-glutamate at positions 
corresponding respectively to positions 342-344 of 
Bst 320 DNA polymerase and phenylalanine at a 
position corresponding to position 422 of Bst 320 
DNA polymerase. For example, DNA polymerases 
derived from other strains of Bacillus 
stearotheimophilus or Bacillus caldotenax or other 
mesophilic bacilli may be easily modified using 
conventional DNA modification techniques to 
include the amino acid or nucleotide substitutions 
identified above. 

The invention also provides a DNA 
construct comprising at least one of the above- 
described DNA polymerase sequences and a vector 
(such as a cloning vector or an expression 
vector) , for introducing the DNA construct into 
eucaryotic or procaryotic host cells (such as an 
E. coli host cell) . In addition, the invention 
further provides a host cell stably transformed 
with the DNA construct in a manner allowing 
production of the peptide encoded by the DNA 
segment in the construct . 

The invention also provides improved 
methods for replicating DNA and sequencing DNA 
using the above -de scribed DNA polymerases of the 
invention. The DNA polymerases are useful in both 
direct dye terminator DNA sequencing and dye- 
primer DNA sequencing. 

Preferably, the method of sequencing a 
DNA strand may comprise the steps of: 
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i) hybridizing a primer to a DNA template to 
be sequenced; 

ii) extending the primer using a DNA 
polymerase which has an ability to reduce 
selective discrimination against incorporation of 
fluorescent dye -labeled dideoxynucleotide 
terminators ddCTP and ddATP, in the presence of 
adequate amounts of nucleotide bases dATP, dGTP, 
dCTP and dTTP, or their analogs, and the four 
fluorescent dye -labeled dideoxynucleotide 
terminators, 

under such conditions that the DNA strand is 
sequenced . 

Further objects and advantages of the 
invention will become apparent from the 
description and examples below. 

BRIEF DESCRIPTION OF THE DRAWINGS 
In the Figures and throughout this 
disclosure, "HiFi Bst" or "Bst 320 M DNA polymerase 
refers to the unmodified naturally occurring DNA 
polymerase having proofreading 3' -5' exonuclease 
activity, either isolated from the cells of No. 
320 strain of Bacillus stearothermophilus or 
produced by overexpression of the gene encoding 
this naturally occurring DNA polymerase. (This Bst 
strain and DNA polymerase are described in U.S. 
Patent 5,747,298.) "HiFi Bst-II" refers to the 
modified form of "HiFi Bst" DNA polymerase which 
has an ability to reduce selective discrimination 
against fluorescent dye- labeled ddCTP and ddATP. 
HiFi Bst-II is an example of one preferred 
embodiment of this invention. 

Figure 1 . This graph shows the 
thermostability at 65 °C of HiFi Bst-II and HiFI 
Bst. 
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Y: relative polymerase activity (%) 
X: incubation time (minutes). 
Figure 2. This shows a autoradiograph of 
a DNA sequencing gel obtained by using 
5 radiolabeled primer with HiFi Bst-II and HiFi Bst, 

and shows the dideoxy- nucleotide incorporation of 
HiFi Bst-II and HiFi Bst in a reaction mixture 
with a suboptimally low ddNTP/dNTP ratios. 

Template: single-stranded M13mpl8; 
10 Primer: -20M13 forward primer. 

Figure 3 . This shows a autoradiograph of 
a DNA sequencing gel obtained by using 
radiolabeled dATP with HiFi Bst and HiFi Bst-II in 
reaction mixtures with optimized ddNTP/dNTP 
15 ratios. The sequence pattern with HiFi Bst-II is 

better than that with HiFi Bst. 

Template: single -stranded M13mpl8,- 
Primer: -20M13 forward primer. 
Figure 4. This shows the results of dye- 
20 primer DNA sequencing with HiFi Bst 

Template: single-stranded pGEM-3Zf ( + ) ; 
Primer: -21M13 forward DYEnamic 
Energy Transfer Dye Primers. 

Figure 5. This shows the results of dye- 
25 primer DNA sequencing with HiFi Bst-II. 

Template: single- stranded M13mpl8; 
Primer: -21M13 forward DYEnamic 
Energy Transfer Dye Primers. 

Figure 6. This shows the results of dye- 
30 terminator DNA sequencing with HiFi Bst 

Template: single- stranded pGEM- 

3Zf <+> ; 

Primer: -2 0M13 forward primer. 
Figure 7. This shows the results of dye- 
35 terminator DNA sequencing with HiFi Bst-II. 

Template: single -stranded M13mpl8; 
Primer: -2 0M13 forward primer. 
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Figure 8. Like Figure 6, this shows the 
results of four fluorescent dye- labeled 
terminators DNA sequencing with HiFi Bst . In 
Figure 8 corrections of the missing or ambiguous 
bases, according to the known pGEM sequence, are 
indicated below the letters "N" or below the 
incorrect base letters. 

Template: single -stranded pGEM- 

3Zf ( + ) ; 

Primer: -2 0M13 forward primer. 

DETAILED DESCRIPTION OF THE INVENTION 

The DNA polymerases of the invention are 
capable of reducing selective discrimination 
against incorporation of fluorescent dye- labeled 
dideoxynucleotide terminators ddCTP and ddATP, 
(but not ddGTP and ddTTP) , in the presence of 
adequate amounts of dNTPs and the four 
terminators . 

The inventors discovered that certain 
modifications of the amino acid sequence of DNA 
polymerases (i.e., modifying the amino acids at 
positions 342-344 to substitute threonine, proline 
and leucine, respectively, for leucine, glutamate 
and glutamate, and modifying the amino acid at 
position 422 to substitute tyrosine for 
phenylalanine, as corresponding to the amino acid 
sequence of Bst 320 DNA polymerase) result in a 
marked reduction of the innate selective 
discrimination against incorporation of 
fluorescent dye- labeled dideoxynucleotide ddCTP 
and ddATP, that is characteristic of the 
unmodified DNA polymerase. This reduction of 
selective discrimination can be demonstrated by 
direct automated fluorescent terminator DNA 
sequencing as recovered missing or ambiguous "C" 
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10 



15 



20 



25 



peaks and/or "A" peaks in the automated sequencing 
results, using a known template as standard. 



polymerase is Bacillus stearothermophilus because 
DNA polymerase isolated from this mesophilic 
bacillus is highly processive, can be used and 
stored in dried down form, has an optimum 
temperature at 65 °C, and can be used for direct 
DNA sequencing without precycling. The preferred 
Bst DNA polymerase is that isolated from strain 
320 with an amino acid sequence as follows: 

Amino acid sequence (SEQ ID NO: 2) : 
AEGEKPLEEM EFAIVDVITE EMLADKAALV VEVMEENYHD 
APIVGIALVN EHGRFFMRPE TALADSQFLA WLADETKKKS 
MFDAKRAWA LKWKGIELRG VAFDLLLAAY LLNPAQDAGD 
IAAVAKMKQY EAVRSDEAVY GKGVKRSLPD EQTLAEHLVR 
KAAAI WALEQ PFMDDLRNNE QDQLLTKLEH ALAAILAEME 
FTGVNVDTKR LEQMGSELAE QLRAIEQRIY ELAGQEFNIN 
SPKQLGVILF EKLQLPVLKK TKTGYSTSAD VLEKLAPHHE 
IVENILHYRQ LGKLQSTYIE GLLKWRPDT GKVHTMFNQA 
LTQTGRLSSA EPNLQNIPIR LEEGRKIRQA FVPSEPDWLI 
FAADYSQIEL RVLAHIADDD NLIEAFQRDL DIHTKTAMDI 
FQLSEEEVTA NMRRQAKAVN FGIVYGISDY GLAQNLNITR 
KEAAE FIERY FASFPGVKQY MENIVQEAKQ KGYVTTLLHR 
RRYLPDITSR NFNVRSFAER TAMNTPIQGS AADIIKKAMI 
DLAARLKEEQ LQARLLLQVH DELI LEAP KE EIERLCELVP 
EVMEQAVTLR VPLKVDYHYG PTWYDAK 



The characters represent the following amino 
acids : 



The preferred source for the DNA 



A 



where, 
alanine (Ala) 



M 



methionine (Met) 



C 



cysteine (Cys) 



N 



asparagine (Asn) 
proline (Pro) 
glutamine (Gin) 



D 



aspartic acid (Asp) 
glutamic acid (Glu) 



P 



E 



Q 
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F: 


phenylanaline (Phe) 


R: 


arginine (Arg) 


G: 


glycine (Gly) 


S: 


serine (Ser) 


H: 


histidine (His) 


T: 


threonine (Thr) 


I : 


isoleucine (lie) 


V: 


valine (Val) 


K: 


lysine (Lys) 


W: 


tryptophan (Trp) 


L: 


leucine (Leu) 


Y: 


tyrosine (Tyr) 



The Bst 32 0 DNA polymerase is 
characterized by possessing a proofreading 3' -5' 
exonuclease activity. 

10 The nucleotide sequence encoding the 

unmodified Bst 320 DNA polymerase is indicated in 
SEQ ID NO:l, in Example 2 below. 

The following amino acid sequence 
represents the modified Bst 320 DNA polymerase as 

15 the preferred embodiment of this invention, 

modified from the naturally- occurring Bst 320 DNA 
polymerase at positions 342-344 to substitute 
threonine, proline and leucine, respectively, for 
leucine, glutamate and glutamate, and at position 

20 422 to substitute tyrosine for phenylalanine. 

Amino acid sequence (SEQ ID: No 4): 
^4AEGEKPLEEMEFAIVDVITEEMLADKAALVVEVMEENYHDAPIV 
GIALvTSTEHGRFFMRPETALADSQFLAWLADETKKKSMFDAKRAVV 
ALKWKGIELRGVAFDLLLAAYLLNPAQDAGDIAAVAKMKQYEAVR 

2 5 SDEAVYGKGVKRSLPDEQTLAEHLVRKAAAI WALEQPFMDDLRNN 

EQDQLLTKLEHALAAILAEMEFTGVNVDTKRLEQMGSELAEQLRA 
I EQRI YELAGQEFNINS PKQLGVILFEKLQLPVLKKTKTGYSTSA 
DVLEKLAPHHEIVENILHYRQLGKLQSTYIEGLLKVVRPDTGKVH 
TMFNQALTQTGRLS S AE PNLQN I P I RTPLGRKI RQAF VP S E PDWL 

30 I FAADYSQ I ELRVLAH I ADDDNL I EAFQRDLDI HTKTAMDI FQLS 

EEE VTANMRRQAKAVNYG I VYG I SD YGLAQNLN I TRKEAAE FIER 
YFAS FPGVKQ YMENI VQEAKQKG YVTTLLHRRRYLPD I TS RNFNV 
RS FAERTAMNTP I QGSAAD 1 1 KKAMI DLAARLKEEQLQARLLLQV 
HDELILEAPKEEIERLCELVPEVMEQAVTLRVPLKVDYHYGPTWY 

35 DAK 
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The underlined amino acids are substituted amino 
acids produced by site-directed mutation of the 
naturally-occurring Bst 320 DNA polymerase. 

. . The modified Bst 320 DNA polymerase is 

5 encoded by a DNA sequence such as the following 

(SEQ ID NO: 3) : 

ATG 

GCCGAAGGGG AGAAACCGCT TGAGGAGATG GAGTTTGCCA 
TCGTTGACGT CATTACCGAA GAGATGCTTG CCGACAAGGC 

10 AGCGCTTGTC GTTGAGGTGA TGGAAGAAAA CTACCACGAT 

GCCCCGATTG TCGGAATCGC ACTAGTGAAC GAGCATGGGC 
GATTTTTTAT GCGCCCGGAG ACCGCGCTGG CTGATTCGCA 
ATTTTTAGCA TGGCTTGCCG ATGAAACGAA GAAAAAAAGC 
ATGTTTGACG CCAAGCGGGC AGTCGTTGCC TTAAAGTGGA 

15 AAGGAATTGA GCTTCGCGGC GTCGCCTTTG ATTTATTGCT 

CGCTGCCTAT TTGCTCAATC CGGCTCAAGA TGCCGGCGAT 
ATCGCTGCGG TGGCGAAAAT GAAACAATAT GAAGCGGTGC 
GGTCGGATGA AGCGGTCTAT GGCAAAGGCG TCAAGCGGTC 
GCTGCCGGAC GAACAGACGC TTGCTGAGCA TCTCGTTCGC 

2 0 AAAGCGGCAG CCATTTGGGC GCTTGAGCAG CCGTTTATGG 

ACGATTTGCG GAACAACGAA CAAGATCAAT TATTAACGAA 
GCTTGAG CAC GCGCTGGCGG CGATTTTGGC TGAAATGGAA 
TTCACTGGGG TGAACGTGGA TACAAAGCGG CTTGAACAGA 
TGGGTTCGGA GCTCGCCGAA CAACTGCGTG CCATCGAGCA 
25 GCGCATTTAC GAGCTAGCCG GCCAAGAGTT CAACATTAAC 

TCACCAAAAC AGCTCGGAGT CATTTTATTT GAAAAG CTGC 
AGCTACCGGT GCTGAAGAAG ACGAAAACAG GCTATTCGAC 
TTCGGCTGAT GTGCTTGAGA AGCTTGCGCC GCATCATGAA 
ATCGTCGAAA ACATTTTGCA TTACCGCCAG CTTGGCAAAC 

3 0 TGCAATCAAC GTATATTGAA GGATTGTTGA AAGTTGTGCG 

CCCTGATACC GGCAAAGTGC ATACGATGTT CAACCAAGCG 
CTGACGCAAA CTGGGCGGCT CAGCTCGGCC GAGCCGAACT 
TGCAAAACAT TCCGATTCGG ACCCCACTG G GGCGGAAAAT 
CCGCCAAGCG TTCGTCCCGT CAGAGCCGGA CTGGCTCATT 
3 5 TTCGCCGCCG ATTACTCACA AATTGAATTG CGCGTCCTCG 

CCCATATCGC CGATGACGAC AATC TAATTG AAGCGTTCCA 
ACGCGATTTG GATATTCACA CAAAAACGGC GATGGACATT 
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TTCCAGTTGA GCGAAGAGGA AGTCACGGCC AACATGCGCC 
GCCAGGCAAA GGCCGTTAAC TACGGTATCG TTTACGGAAT 
TAGCGATTAC GGATTGGCGC AAAACTTGAA CATTACGCGC 
AAAGAAGCTG CCGAATTTAT CGAACGTTAC TTCGCCAGCT 
TTCCGGGCGT AAAGCAGTAT ATGGAAAACA TAGTGCAAGA 
AGCGAAACAG AAAGGATATG TGACAACGCT GTTGCATCGG 
CGCCGCTATT TGCCTGATAT TACAAGCCGC AATTTCAACG 
TCCGCAGTTT TGCAGAGCGG ACGGCCATGA ACACGCCAAT 
TCAAGGAAGC GCCGCTGACA TTATTAAAAA AGCGATGATT 
GATTTAGCGG CACGGCTGAA AGAAGAGCAG CTTCAGGCTC 
GTCTTTTGCT GCAAGTGCAT GACGAGCTCA TTTTGGAAGC 
GCCAAAAGAG GAAATTGAGC GATTATGTGA GCTTGTTCCG 
GAAGTGATGG AGCAGGCCGT TACGCTCCGC GTGCCGCTGA 
AAGTCGACTA CCATTACGGC CCAACATGGT ATGATGCCAA 
A 

The characters represent the following 
nucleotides : 

A: Adenosine T: Thymidine 

C: Cytidine G: Guanosine 

The underlined nucleotides are 
substituted nucleotides produced by site-directed 
mutation of the naturally-occurring Bst 320 
polymerase . (As would be apparent to someone 
skilled in this art, this DNA sequence does not 
indicate the starting codon.) 

The invention also contemplates any DNA 
sequence that is complementary to the modified Bst 
320 DNA sequence, for instance, DNA sequences that 
would hybridize to the above DNA sequence of the 
modified DNA polymerase under stringent 
conditions. As would be understood by someone 
skilled in the art, the invention also 
contemplates any DNA sequence that encodes a 
peptide having these characteristics and 
properties (including degenerate DNA code) . 

This invention also contemplates allelic 
variations and mutations (for instance, adding or 
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deleting nucleotide or amino acids, sequence 
recombination or replacement or alteration) which 
result in no substantive change in the function of 
the. DNA polymerase or its characteristics. For 
instance, the DNA polymerases encompass non- 
critical substitutions of nucleotides or amino 
acids that would not change functionality (i.e., 
such as those changes caused by a transformant 
host cell) . In addition, the invention is 
intended to include fusion proteins and muteins of 
the unique DNA polymerases of this invention. 

The DNA sequences and amino acid 
sequences for the modified DNA polymerase of this 
invention are also obtainable by, for instance, 
isolating and purifying DNA polymerase from a 
Bacillus stearothermophilus , or a bacterial strain 
otherwise derived from Bacillus 

stearothermophilus, or other mesophilic bacillus 
strains such as Bacillus caldotenax or Bacillus 
caldolyticus . The DNA polymerases obtained from 
these organisms may be easily modified using 
conventional DNA modification techniques to 
achieve the reduction in fluorescent dye- labeled 
ddCTP and ddATP selective discrimination, as long 
as the unmodified amino acid sequences have 
leucine-glutamate-glutamate at positions 
corresponding respectively to positions 342-344 of 
Bst 320 DNA polymerase and phenylalanine at a 
position corresponding to position 422 of Bst 320 
DNA polymerase. For instance, using the primers 
and methods of screening described herein, someone 
skilled in the art could isolate a DNA polymerase 
having the same properties and function from other 
strains . 

In the DNA polymerases currently used in 
conventional DNA sequencing protocols, it is 
preferred that the enzymes have low or no 
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exonuclease activity. However, in this invention, 
it is preferred that the DNA polymerases have a 
function of high fidelity ("HiFi") nucleotide 
incorporation-. Therefore, in one preferred 
embodiment the invention entails modification of a 
naturally-occurring Bst DNA polymerase having a 
proofreading 3' -5' exonuclease activity. This 
preferred modified DNA polymerase (e.g., "HiFi 
Bst -II") has a nucleotide sequence indicated in 
SEQ ID: NO 3 and an amino sequence indicated in SEQ 
ID:NO 4. To initially obtain a Bst DNA polymerase 
having proofreading 3' -5' activity, strains of 
Bacillus stearothermophilus can be segregated into 
different groups according to the proof-reading 
exonuclease activity of their respective DNA 
polymerases . 

The invention also provides a DNA 
construct comprising at least one of the DNA 
sequences of the modified DNA polymerase and a 
vector (such as a cloning vector or an expression 
vector) , for introducing the DNA construct into 
host cells. An example of a suitable vector is 
pYZ2 3/LF, described below. 

The host cells need only be capable of 
being stably transformed with the DNA construct in 
a manner allowing production of the peptide 
encoded by the DNA segment in the construct 
(preferably in large quantity) . The host cells 
may be of eucaryotic or procaryotic origin (such 
as a E_;_ coli host cell) . For instance, the host 
cell may be a mesophilic organism, although this 
is not a necessary requirement in order that a 
host cell be effective. 

The invention also provides improved 
methods for DNA sequencing using the above - 
described novel DNA polymerases. The methods 
entail sequencing a DNA strand by conventional 
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protocols with the following modifications: 

i) hybridizing a primer to a DNA template to 
be sequenced; 

. . ii) extending the primer using a DNA 
5 polymerase described above, in the presence of 

radiolabeled dATP, nucleotides dGTP, dCTP and 
dTTP, or their analogs, and ddNTP chain 
terminators and 

iii) allowing a DNA strand to be sequenced. 

10 All four dNTPs, including dCTP, are 

incorporated equally effectively in the chain 
elongation during sequencing reaction catalyzed by 
the DNA polymerases of the invention with a high 
processivity and a high elongating rate. 

15 Preferably the nucleotide premix 

concentrations of modified Bst DNA polymerase used 
in radiolabeled DNA sequencing are as following: 
A mix: dATP 0.8/iM, dCTP 80jiM, dGTP 80/iM, dTTP 
80/iM, ddATP 25/iM; 

20 C mix: dATP 0.8/iM, dCTP 8/iM, dGTP 80/iM, dTTP 

80/iM, ddCTP 20/iM; 

G mix: dATP 0.8/iM, dCTP 80/iM, dGTP 8/iM, dTTP 
80/iM, ddGTP 50/iM; 

T mix: dATP 0.8/xM, dCTP 80/iM, dGTP 80/iM, dTTP 
25 8/iM, ddTTP 50//M. (This mixture is useful for the 

particular modified Bst 320 DNA polymerase set 
forth above, as well as for other modified Bst DNA 
polymerases . ) 

In addition, the invention contemplates 
30 other uses of the modified DNA polymerases. For 

instance, the DNA polymerase can also be use in 
(1) filling- in 5' overhangs of DNA fragments; (2) 
synthesis of DNA probes by random primers labeling 
methodology; and (3) site-directed mutagenesis. 
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illustrative of the invention. 

Example 1 : Screening for Bst polymerases 

This invention also involves a method to 
measure the proof-reading 3' -5' exonuclease 
activity of purified DNA polymerases. The method 
is useful to screen a large number of bacterial 
strains, such as Bacillus stearothermophilus and 
other mesophilic bacterial strains, to select a 
strain which produces a DNA polymerase with a high 
proof-reading 3' -5' exonuclease activity. For 
instance, the method to test the proof-reading 3' - 
5' exonuclease activity of DNA polymerase was 
carried out as follows. 

A DNA primer and two DNA templates with 
following sequences were synthesized chemically, 
using a DNA synthesizer. 

17 -base primer 5 r CATTTTGCTGCCGGTCA 3' 

1 mg/ml 
(SEQ ID NO: 5) 

Template (a) 3 ' GTAAAACGACGGCCAGTCTT 5 ' 

10 mg/ml 
{SEQ ID NO:6) 

Template (b) 3 ' GTAAAACGACGGCCAGTCGG 5 ' 

10 mg/ml 
(SEQ ID NO: 7) 
, To produce the radiolabeled primer, 1 jil 
(1 tig) of primer, 5 jul (50 ^g) of template (a), 1 
/il of [a- 32 P] dATP (800 Ci/mmole) , 1 pi of dGTP 
(0.5 mM) , 1 (xl of Taq DNA polymerase (1 unit), and 
1 pi of buffer consisting of 500 mM Tris-Cl, pH 
9.0, and 150 mM MgCl 2 , were mixed in a test tube 
and incubated in a 65 *C water bath for 5 minutes. 
The mixture was subject to alkaline denaturing gel 
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electrophoresis. The radioactive band containing 
the 20-base nucleotide was isolated and dissolved 
in 12 y.1 of 10 mM Tris-Cl buffer, containing 1 mM 
EDTA, pH 8.0. The final product represents the 
following labeled 20-base primer. 

5 ' CATTTTGCTGCCGGTCAGA* A * 3 ' 
(* = 32 P labeled) 
(SEQ ID NO: 8) 
To produce radiolabeled primer- template 
complexes, 5 jxl of the labeled primer was mixed 
with 10 fil of template (a) or template (b) 
respectively to form the following: 

Complex (a) 

5 ' CATTTTGCTGCCGGTCAGA*A* 3' {same as SEQ ID NO: 8) 
3 ' GTAAAACGACGGCCAGTCT T 5' (same as SEQ ID NO: 6) 

Complex (b) 

5' CATTTTGCTGCCGGTCAGA* A* 3' (same as SEQ ID NO: 8) 
3' GTAAAACGACGGCCAGTCG G 5' (same as SEQ ID NO: 7) 

The free radiolabeled primer was removed through a 
G-50 Sephadex column. 

An aliquot of complex (a) which had two 
correctly matched radiolabeled A*s at the 3 ' 
terminus of the primer, and an aliquot of complex 
(b) which had two mismatched A*s at the 3' 
terminus of the primer, were then pipetted into 
two individual vials of scintillation fluid and 
their radioactivity was measured in a 
scintillation counter, and both complexes were 
adjusted with buffer to a concentration containing 
the same molarity of incorporated [a- 32 P] dAMP. 

To perform the proof-reading 3' -5' 
exonuclease activity, 20 /il of complex (a) or 
complex (b) , 8 fil reaction buffer consisting of 15 
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mM Tris-Cl and 15 mM MgCl 2 , pH 8.5, 4 units of DNA 
polymerase, and enough water to make up a total 
volume of 40 pi were pipetted into a test tube and 
mixed well . The mixture was subdivided into 
aliquots of 3 /il each in 0.5 ml microcentrifuge 
tubes and was then covered with 3 /il paraffin in 
each tube . The microcentrifuge tubes were 
incubated in a 65 °C water bath. At l, 2, 3, 5, 10, 
and 2 0 minutes, a pair of the microcentrifuge 
tubes were taken out from the water bath and the 
content of each tube was dotted onto a DE-81 
Whatman filter paper. One of each pair of the 
filter papers was put in scintillation fluid 
directly and the radioactivity was counted in cpm 
value in a scintillation counter; the other was 
washed three times in 0.3 M sodium phosphate 
buffer, pH 6.8 before being put into the 
scintillation fluid for counting. 

The difference in radioactivity 
expressed in cpm value between the washed filter 
paper and the unwashed filter paper in each pair 
was interpreted as representing the relative 
quantity of labeled nucleotides excised by the 3'- 
5' exonuclease activity from the 3' terminus of 
the primer. A DNA polymerase that excised the 
radiolabeled nucleotides A*s from complex (b) more 
efficiently than from complex (a) possessed proof- 
reading 3 ' -5 'exonuclease activity. A DNA 
polymerase that excised the radiolabeled 
nucleotides A*s from complex (a) faster than from 
complex (b) , or at nearly the same rate, was 
interpreted as possessing a non-specific 3' -5' 
exonuclease activity which is considered 
unsuitable for DNA sequencing. 

Using these methods, a strain of 
bacteria was isolated from among the strains of 
Bacillus stearothermophilus from various sources 
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which is distinguished in its fast growth rate. 
This strain reached an optimum exponential growth 
within 3 hours for DNA polymerase production. The 
strain was also able to produce a DNA polymerase 
with a proof-reading 3' -5' exonuclease activity. 
This strain of Bacillus stearothermophilus was 
labeled Bst No. 320. 

As persons skilled in this art would 
appreciate, the bacterial strain, or even the 
strain of Bacillus stearothermophilus , from which 
a mesophilic DNA polymerase of the invention can 
be obtained may be derived using the above - 
described methods or others known in the art from 
strains of Bacillus stearothermophilus or other 
bacteria strains {especially mesophilic strains) , 
including wild strains or mutant strains acquired 
by various means, including spontaneous mutation. 

To prepare the preferred purified Bst 
DNA polymerase, the cells of Bst No. 320 were 
grown at 55°C in a liquid medium consisting of 1% 
polypeptone, 0.5% yeast extract and 0.5% NaCl , 
pH7.0-7.2. The 3 hr old cells were collected after 
centrif ugation and suspended in 4 volumes of TME 
buffer (50mM Tris-HCl, pH7 . 5 , lOmM (3- 
mercaptoethanol, and 2 mM EDTA) , containing 100 mg 
lysozyme and 23 mg phenylmethyl sulphonyl - 
fluoride/ml. The cells were broken by sonication 
in ice. The supernatant was pooled after 
centrif ugation at 28,000 rpm in a Spinco L 30 
rotor . 

The purified Bst DNA polymerase of the 
invention was prepared according to Okazaki and 
Kornberg (7) with appropriate slight modifications 
and the large fragment of the DNA polymerase was 
obtained by partial digestion of the whole DNA 
polymerase with the proteinase subtilisin (type 
Carlsberg) basically according to Jacobsen et al . 
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(8) . 

The procedure for purification of enzyme 
was followed as described in Ye and Hong (4) . 
This Bst DNA polymerase possessed a proof-reading 
3 '-5' exonuc lease activity. 

The Bst polymerase was tested for proof- 
reading and non-specific 3 ' -5 ' exonuclease 
activities as described above. The results showed 
that the polymerase excised the mismatched 
incorporated nucleotides from the 3' terminus of a 
double -stranded DNA at a high rate, reaching the 
plateau of hydrolysis in about 3 minutes, about 8 
times more efficiently in the first 3 minutes of 
reaction than those correctly matched with the 
nucleotides of the template. This enzyme is 
referred to herein as HiFi Bst DNA polymerase, and 
is distinguishable from Bst DNA polymerases 
isolated from other strains of Bacillus 
s tearo thermophi 1 us . 

This prodedure of using HiFi Bst as the 
DNA polymerase in the classic radiolabeling Sanger 
reaction for DNA sequencing and its autoradiograph 
are illustrated in Example 8. The data obtained 
by adapting this procedure to use HiFi Bst as the 
DNA polymerase in dye -labeled primer automated 
fluorescent DNA sequencing are illustrated in 
Example 9. These results indicate that HiFi Bst 
DNA polymerase can be used for the classic Sanger 
manual sequencing and the fluorescent dye- labeled 
primer sequencing with high processivity and high 
fidelity. 

However, when the protocol was modified 
for fluorescent dye- labeled ddNTP terminator 
automated DNA sequencing, numerous "C" peaks and 
W A" peaks were missing or appeared ambiguous in 
the sequence plot, especially when the W C" peak 
was immediately after a "G" peak or after an W A" 
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peak, and when the "A" peak was immediately after 
a M G" peak. This selective discrimination against 
incorporation of dye-labeled ddCTP and dye-labeled 
ddATP is sequence- related since many "C" and "A" 
peaks following an immediate "C" peak or an 
immeciate W T" peak remained strong and correctly 
resolved in the same color plot of sequence 
analysis. (See Example 9) This phenomenon of 
selective discrimination against incorporation of 
fluorescent dye -labeled ddCTP and dye- labeled 
ddATP is observed with all DNA polymerases 
isolated from different strains of Bacillus 
stearothermophilus and Bacillus caldotenax, and 
appears to be characteristic of DNA polymerases of 
the mesophilic bacilli. 



Example 2 : Mutation of the gene for naturally- 
occurring Bst DNA polymerase having proofreading 
3 '-5' exonuclease activity 

The DNA fragment LF containing the gene 
initially isolated from the wild Bst 320 has the 
following sequence (see SEQ ID NO:l) : 



DNA sequence (isolated, 

GCCGAAGGGG AGAAACCGCT 

TCGTTGACGT CATTACCGAA 

AGCGCTTGTC GTTGAGGTGA 

GCCCCGATTG TCGGAATCGC 

GATTTTTTAT GCGCCCGGAG 

ATTTTTAGCA TGGCTTGCCG 

ATGTTTGACG CCAAGCGGGC 

AAGGAATTGA GCTTCGCGGC 

CGCTGCCTAT TTGCTCAATC 

ATCGCTGCGG TGGCGAAAAT 

GGTCGGATGA AGCGGTCTAT 

GCTGCCGGAC G AACAGACG C 



mrif ied) : 

TGAGGAGATG GAGTTTGCCA 

GAGATGCTTG CCGACAAGGC 

TGGAAGAAAA CTACCACGAT 

ACTAGTGAAC GAGCATGGGC 

ACCGCGCTGG CTGATTCGCA 

ATGAAACGAA GAAAAAAAGC 

AGTCGTTGCC TTAAAGTGGA 

GTCGCCTTTG ATTTATTGCT 

CGGCTCAAGA TGCCGGCGAT 

GAAACAATAT GAAGCGGTGC 

GGCAAAGGCG TCAAGCGGTC 

TTGCTGAGCA TCTCGTTCGC 
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AAAGCGGCAG 


CCATTTGGGC 


GCTTGAGCAG 


CCGTTTATGG 




ACGATTTGCG 


GAACAACGAA 


CAAGATCAAT 


TATTAACGAA 




GCTTGAGCAC 


GCGCTGGCGG 


CGATTTTGGC 


TGAAATGGAA 




TTCACTGGGG 


TGAACGTGGA 


TACAAAGCGG 


CTTGAACAGA 


5 


TGGGTTCGGA 


GCTCGCCGAA 


CAACTGCGTG 


CCATCGAGCA 




GCGCATTTAC 


GAGCTAGCCG 


GCCAAGAGTT 


CAACATTAAC 




TCACCAAAAC 


AGCTCGGAGT 


CATTTTATTT 


GAAAAGCTGC 




AGCTACCGGT 


GCTGAAGAAG 


ACGAAAACAG 


GCTATTCGAC 




TTCGGCTGAT 


GTGCTTGAGA 


AGCTTGCGCC 


GCATCATGAA 


10 


ATCGTCGAAA 


ACATTTTGCA 


TTACCGCCAG 


CTTGGCAAAC 




TGCAATCAAC 


GTATATTGAA 


GGATTGTTGA 


AAGTTGTGCG 




CCCTGATACC 


GGCAAAGTGC 


ATACGATGTT 


CAACCAAGCG 




CTGACGCAAA 


CTGGGCGGCT 


CAGCTCGGCC 


GAGCCGAACT 




TGCAAAACAT 


TCCGATTCGG 


CTCGAAGAGG 


GGCGGAAAAT 


15 


CCGCCAAGCG 


TTCGTCCCGT 


CAGAGCCGGA 


CTGGCTCATT 




TTCGCCGCCG 


ATTACTCACA 


AATTGAATTG 


CGCGTCCTCG 




CCCATATCGC 


CGATGACGAC 


AATCTAATTG 


AAGCGTTCCA 




ACGCGATTTG 


GATATTCACA 


CAAAAACGG C 


GATGGACATT 




TTCCAGTTGA 


G CGAAGAGG A 


AGTCACGGCC 


AACATGCGCC 


20 


GCCAGGCAAA 


GGCCGTTAAC 


TTCGGTATCG 


TTTACGGAAT 




TAGCGATTAC 


GGATTGGCGC 


AAAACTTGAA 


CATTACGCGC 




AAAGAAGCTG 


CCGAATTTAT 


CGAAC GTTAC 


TTCGCCAGCT 




TTCCGGGCGT 


AAAGCAGTAT 


ATGGAAAACA 


TAGTGCAAGA 




AGCGAAACAG 


AAAGGATATG 


TGACAACGCT 


GTTGCATCGG 


25 


CGCCGCTATT 


TGCCTGATAT 


TACAAGCCGC 


AATTTCAACG 




TCCGCAGTTT 


TGCAGAGCGG 


ACGGCCATGA 


ACACGCCAAT 




TCAAGGAAGC 


GCCGCTGACA 


TTATTAAAAA 


AGCGATGATT 




GATTTAGCGG 


CACGGCTGAA 


AGAAGAGCAG 


CTTCAGGCTC 




GTCTTTTGCT 


GCAAGTGCAT 


GACGAGCTCA 


TTTTGGAAGC 


30 


GCCAAAAGAG 


GAAATTGAGC 


GATTATGTGA 


GCTTGTTCCG 




GAAGTGATGG 


AGCAGGCCGT 


TACGCTCCGC 


GTGCCGCTGA 




AAGTCGACTA 


CCATTACGGC 


CCAACATGGT 


ATGATGCCAA 




ATAA (1764 nucleotides total) 





Site directed mutagenesis was performed 
3 5 as described by Kunkel et al.(l4) The DNA fragment 

(designated "LF") containing the gene for Bst DNA 
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polymerase having proofreading exonuclease 
activity was cloned from the expression vector 
pYZ23/LF into plasmid pUC119. The constructed 
plasmid pUC119/LF was then transformed into E. 
coli CJ23 6, a mutant of E. coli that lacks the 
enzymes dUTPase and uracil N-glycosylase . 
Therefore, when grown in a medium supplemented 
with uridine, this mutant of E. coli as well as 
the plasmids in the cells will incorporate 
deoxyuridine into the DNA in place of thymidine 
and the uracils will not be removed readily. 

As the constructed plasmid grew in the 
cells of E. coli CJ236 and in the presence of 
uracil and M13K07 helper phage, some of the normal 
thymidine bases of the DNA in the newly produced 
single -stranded pUC119/LF were replaced by 
uracils. These uracil -containing DNAs were used 
as the template in vitro for the production of a 
complementary oligonucleotide that contained the 
desired DNA sequence alteration, but with only 
dTMPs and not dUMP residues. 

In practice, the expression vector 
pYZ23/LF was digested with restriction enzymes Eco 
RI and Bam HI , and the DNA fragment LF was 
separated and cloned into plasmid pUC119 which had 
been previously digested with the same restriction 
enzymes. The constructed plasmid pUC119/LF was 
then transformed into E^ coli CJ236. For gaining 
the uracil -containing single -stranded pUC119/LF, a 
colony of E^_ coli CJ236 containing pUC119/LF was 
selected and inoculated into 2 ml of 2 x YT medium 
which was supplemented with 0.25 ug/ml of uridine 
and 2 x 10 8 to 4 x 10 s pfu/ml of M13K07 as helper 
phage. After incubation at 37°C with strong 
agitation for 1 hour, a kanamycin solution (2 5 
mg/ml in H 2 0) was added to the culture to a final 
concentration of 70 ug/ml. The incubation was 
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allowed to continue for another 14-18 hours at 
37°C with strong agitation. Then 1.5 ml of the 
infected culture was transferred to a 
microcentrifuge tube, and centrifuged at 12,000 x 
g for 5 minutes at 4°C. The uracil -containing 
single-stranded pUC119/LF was precipitated and 
purified from the supernatant according to 
standard PEG/NaCl and ethanol procedures. 

After performing a series of 
experiments, the inventors found that the combined 
effects of changing the amino acids leucine- 
glutamate-glutamate {LEE} at the location 342-344, 
to respectively threonine-proline- leucine (TPL) , 
and the amino acid phenylalanine (F) at location 
422, to tyrosine (Y) in the peptide structure of 
HiFi Bst DNA polymerase markedly reduced its 
selective discrimination against incorporation of 
fluorescent dye-labeled ddCTP and dye-labeled 
ddATP to such a level that direct automated 
fluorescent DNA sequence (although not cycle- 
sequencing) can be performed with the dye- 
terminator technology when the mutated enzyme of 
the current invention is used. It is of interest 
to note that this modified HiFi Bst, now referred 
to as HiFi Bst- II DNA polymerase, exhibits the 
function of preferentially incorporating more 
fluorescent dye -labeled ddCTP and dye -labeled 
ddATP onto the 3' end dGMP and the dAMP bases of 
the extending DNA strands during enzymatic 
reaction, than the unmodified naturally occurring 
HiFi Bst polymerase. 

The end result is the recovery of the 
W C" and "A" peaks which otherwise would have been 
missing or ambiguous on the sequence analysis 
color plot. At the same time, the modified enzyme 
did not indiscriminately generate an excess amount 
of dye- labeled "G" terminated or dye- labeled W T" 
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terminated DNA fragments. Even the "C" and "A" 
peaks were not uniformly raised in a blanket 
manner, but only raised in the formerly depressed 
locations after a W G" and/or an "A" . (See Example 
9) . Thus, this genetic modification of the HiFi 
Bst to HiFi Bst-II results in a DNA polymerase 
that reduces the selective discrimination against 
incorporation of the fluorescent dye- labeled ddCTP 
and dye-labeled ddATP, rather than merely 
increases the ability of the parent enzyme to 
incorporate these dye-labeled dideoxynucleo tides . 

HiFi Bst -II, and the other novel similar 
DNA polymerases of this invention, can be used for 
the classic radiolabeling Sanger method. (See 
Example 8.) HiFi Bst-II appears to generate a 
better sequencing pattern than HiFi Bst and 
requires less ddNTPs to terminate the extending 
reaction (Figure 3) . For instance, in the 
optimized reaction mixture for the unmodified HiFi 
Bst DNA polymerase, the ddNTP/dNTP ratios in the 
A, C, G and T mix were 40, 6.25, 18.25 and 18.72, 
respectively. In the optimized reaction mixture 
for the modified HiFi Bst-II DNA polymerase, the 
corresponding ddNTP/dNTP ratios in the A, C, G and 
T mix were 40, 2.5, 6.25 and 6.25, respectively. 
Therefore, there was an up -to about three -fold 
reduction in the amount of ddNTPs used after 
genetic modification of the naturally-occurring 
DNA polymerase . 

For the radiolabeling classic Sanger 
method of DNA sequencing, the optimized reaction 
mixtures for either HiFi Bst or HiFi Bst-II must 
contain much more ddNTPs than dNTPs to generate a 
ladder of DNA fragments for sequencing analysis 
because the DNA polymerases of the mesophilic 
bacilli tend to incorporate dNTPs more efficiently 
than ddNTPs. The above -described genetic 
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modification appears to increase the ability of 
the naturally-occurring enzymes to incorporate 
ddNTP in the presence of a corresponding competing 
dNTP. to about three -fold at the concentration 
5 ratios commonly used for DNA sequencing. However, 

if much higher concentrations of the nucleotides 
were used for the experiment, and the ddNTP/dNTP 
ratio was reduced to a level that is suboptimal 
for DNA sequencing {for instance at a ratio of 

10 1/3) , the increased ability for incorporating 

ddNTPs after modification of the enzyme could be 
dramatized. (See Example 6, Figure 2) . 

Similar to the results obtained with 
radiolabeling Sanger method, both HiFi Bst and 

15 HiFi Bst -II can be adapted for fluorescent dye- 

labeled primer automated DNA sequencing and 
produce comparable results without selective 
suppression of any specific fluorescent peaks in 
the sequencing plot (see Example 9) although the 

20 peaks generated by HiFi Bst- II appear to be more 

even than those by HiFi Bst. 

To change amino acids leucine, glutamic 
acid and glutamic acid (LEE) at positions 342-344, 
respectively in the Bst polymerase into threonine, 

2 5 proline and leucine (TPL) , respectively, Primer 1 

was designed as following (see SEQ ID NO 10) : 
5' - CATTCCGATTCGG ACCCCACTG GGGCGGAAAATCCG - 3 

To change amino acid phenylalanine (F) 
at position 422 in the Bst DNA polymerase into 

30 tyrosine ( Y) , Primer 2 was designed as following 

(see SEQ ID NO: 9) : 

5 ' - GCCGTTAACTACGGTATCGTTTACGG - 3 ' 

After phosphorylation of the 5' ends of 
the oligonucleotides by T4 polynucleotide kinase, 

35 the two primers designed above were annealed to 

the single-stranded uracil -containing pUC119/LF 
purified from above. In the presence of the usual 
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dNTPs (dATP , dCTP, dGTP and dTTP) , T4 DNA 
polymerase was used to synthesize in vitro the 
strands of DNA complementary to the uracil - 
containing pUC119/LF template, and T4 ligase was 
used to ligate the synthesized strands to form a 
complete double -stranded plasmid which was 
composed of one single -stranded, not mutagenic, 
uracil -containing pUC119/LF and one complementary 
single -stranded, mutagenic, thymidine -containing 
DNA fragment that had been altered by primer 1 and 
primer 2 described above . These newly formed 
double-stranded plasmids were then transformed 
into E. coli JM109. The template strand was 
rendered biologically inactive. The transformed 
strain of E. coli JM109 whose plasmids contained 
the mutated DNA, now referred to as pUC119/LF-M, 
was screened out with DNA sequencing of its 
plasmids . 

Example 3 : Cloning and expression of the modified 
Bst DNA polymerase having both ability to reduce 
selective dye-labeled ddNTP discrimination and 
proofreading 3' -5' exonuclease activity 

The plasmid pUC119/LF-M was prepared 
from the strain of Escherichia coli JM109 
containing the mutated DNA. The mutated DNA 
fragment (LF-M) containing the mutated gene for 
the Bst polymerase was recombined back into the 
expression vector pYZ23 . The constructed plasmid 
pYZ23/LF-M was then transformed into Escherichia 
coli JF1125. The mutation was further confirmed by 
double -stranded dideoxy DNA sequencing of isolated 
plasmid. 

The strain of Escherichia coli JF1125 
containing pYZ23/LF-M was inoculated into LB 
culture containing 100^g/ml ampicillin, and was 
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incubated overnight at 30°C. The overnight culture 
was inoculated into a large volume of fresh 
culture, and was incubated at 3 0°C until the OD 600 
of the culture reached 0.7. The culture was then 
heated at 41 °C for 3 hours for induction. The SDS- 
PAGE analysis of the cell extract showed that the 
cloned mutated gene for the modified Bst DNA 
polymerase was overexpressed . 

Example 4 : Isolation and purification of the 
modified Bst DNA polymerase having both ability to 
reduce selective dye- labeled ddNTP discrimination 
and proofreading 3' -5' exonuclease activity 

The expressed cells of Escherichia coli 
JF1125 containing pYZ23/LF-M grown in condition as 
described above were thawed and washed with buffer 
[lOmM Tris-HCl (pH7 . 5 at room temperature), lOmM {3~ 
Mercaptoethanol , 2 mM EDTA, 0.9% NaCl] . The 
pellets were then suspended in buffer [50mM Tris- 
HCl {pH7.5 at room temperature), lOmM (3- 
Mercaptoethanol, 2mM EDTA, 100/ig/ml Lysozyme, 
23jig/ml PMSF] (4ml /g pellet) . After 20 min at room 
temperature, the mixture was cooled on salt -ice 
and sonicated briefly to complete lysis. The cell 
extract obtained by centrifugation at 18,000rpm at 
4°C for 2 0 minutes, was then treated step by step 
as follows: 

(A) The cell extract was heated at 60 °C for 3 0 
minutes, and cooled to 4°C, then centrifuged at 
15,000rpm at 4°C for 20 minutes; 

(B) 5% Polymin P was added into supernatant to 
0.6%, and mixed quickly for 3 0 minutes, then 
centrifuged; 

(C) The pellet was resuspended in Buffer A[50mM 
Tris-HCl (pH7 . 5 at room temperature), ImM EDTA, ImM 
{3 -Mercaptoethanol] containing 800mM NaCl and 5% 
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Glycerol at 4°C, and then centrifuged; 

(D) Ammonium sulfate was added into the 

supernatant to 60% saturation at 4°C / and 
.mixed for 3 0 minutes, then centrifuged; 

(E) The ammonium sulfate pellet was resuspended in 
30ml of 60% saturated ammonium sulfate at 4°C, and 
then recent rifuged; 

(F) The pellet was suspended in Buffer A 
containing lOOmM KC1 and dialysed against the same 
buffer for hours at 4°C, then centrifuged. The 
insoluble protein was discard; 

(G) The supernatant was added to pass through a 
DE-52 column. The column was washed, and the peak 
DNA polymerase was eluted using a 100-600 mM KC1 
linear gradient in Buffer A, concentrated in 
Buffer A containing 50% (w/v) PEG-6000, dialyzed in 
Buffer A containing 100 mM KC1 ; 

(H) The solution was then applied to Heparin- 
Sepharose CL-4B column. The peak DNA polymerase 
was eluted with a linear gradient of 100-800 mM 
KC1 in Buffer A, concentrated and finally dialyzed 
in buffer A containg 50% glycerol. 

The resulting modified Bst DNA polymerase 
has been proved to be homogenous by polyacryl amide 
gel electrophoresis . And the enzyme obtained was 
stored in -20°C. 



Example 5: Determination of the thermostability 
of unmodified Bst DNA polymerase and modified Bst 
DNA polymerase 

The DNA polymerases of Examples 1 and 5 
were incubated at 65°C for 0, 5, 10, 20, 30, 40, 
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50 minutes respectively, and placed into ice-water 
immediately. The polymerase activity of these DNA 
polymerases was determined at 60 °C. 

The polymerase activity of DNA 
5 polymerase was determined as follows: 

5 x Reaction Solution: 
1M Tris-HCl (pH7.6) 16.75 ml 

1M MgCl 2 1.675 ml 

1M jS-Mercaptoethanol 0.25 ml 

10 ddH 2 0 adjusted to 50 ml 

Reaction Storage: 

5 x Reaction Solution 60/il 

dNTPs (ImM each) 10/il 
l.Sfig/pl DNase I activated 

15 calf thymus DNA lOfil 

ddH 2 0 10/il 

a- 32 P-dATP appropriate amt . 



Reaction Mixture: 

Reaction Storage 30/il 

2 0 Sample 5/il 

ddH 2 0 GSfil 



The reaction mixtures were prepared as 
per the recipe above, and incubated at 60 °C for 3 0 
minutes. Then the reaction mixtures were pipetted 

25 onto DE-81 filters respectively. After all of the 

fluid has evaporated, the amount of radioactivity 
on each filter was measured with scintillation 
(XJ . The filters were washed three times with 
0.3M Na 2 HP0 4 at room temperature, 10 minutes each 

3 0 times, dried at room temperature and then the 

amount of radioactivity on each filter was 
measured again (X 2 ) . 

The polymerase activity of sample 

(u/ml)= 

46 
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[X 2 - X 2 7| * 266 
^1 x io| 

(X 10 and X 20 are the amount of radioactivity 
measured with water as control sample) 

Unit definition of polymerase activity: 
One unit is the amount of DNA polymerase required 
to incorporate 10 nanomoles of dNTPs into DNA in 
30 minutes at 60°C . 

The thermostability of DNA polymerase is 
expressed with the half life of polymerase 
activity at 65 °C. Figure 1 shows the comparison of 
thermostability of HiFi Bst and HiFi Bst-II. The 
half life of HiFi Bst at 65°C was 8.5 minutes, and 
that of HiFi Bst-II was 16 minutes. HiFi Bst-II 
was more thermostable than HiFi Bst. 

Example 6 : Demonstration of increased ddNTP 
incorporation by modified Bst DNA polymerase in 
suboptimal sequencing conditions 

The following procedure was followed: 

1. The -20M13 forward primer was 
radiolabeled using y- 32 P-ATP and T4 Polynucleotide 
kinase ; 

2. The following components were combined in 
a microcentrifuge tube: 

SxReaction Buffer 2 . 0/xl 

radiolabeled primer 1.0/xl (2.5ng) 

Template 7.0jzl (ljxg M13mpl8 ssDNA) 

The final volume was 10/il . The contents were mixed 

and spun for 2-3 seconds; 

3. The tube were placed in a 75°C water bath 
for 5 minutes. Then the tube was allowed to cool 
slowly to ambient temperature over a course of 10 
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minutes ; 

4. 1.0/il of modified Bst DNA polymerase (of 
Example 5) (lu/^1) was added. The mixture was 
mixed gently and spun for 2-3 seconds; 

5. 4 tubes were labelled "A", "C", "G", "T", 
respectively and 2/il of each premixed nucleotide 
solution and 2.5^/1 of main mixture (from step 3) 
was added to the respective reaction 

tube ; 

6. The tubes were incubated at 65°C for 15 
minutes ; 

7. The reactions were stopped by adding 4 . 0/xl 
of Stop Solution (95% deionized formamide, lOmM 
EDTA, 0.05% xylene cyanol FF, 0.05% bromophenol 
blue) to each tube; 

8. The samples were denatured at 90°C for 2 
minutes, and immediately placed on ice; 

9. 4-5/il of samples were loaded onto each 
lane of 6% (8M urea) sequencing gel, and 
electrophoresis was carried out. 

Note: The Components of the Premixed Nucleotide 
Solutions : 

A mix: dNTPs 120/iM, ddATP 40/iM 

C mix: dNTPs 120//M, ddCTP 40^M 

G mix: dNTPs 120/iM, ddGTP 40/zM 

T mix: dNTPs 120/iM, ddTTP 40/zM 

Figure 2 shows the comparison of ddNTP 
incorporation of HiFi Bst- I I DNA polymerase and 
HiFi Bst DNA polymerase. In this radiolabeling 
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DNA sequencing experiment, high concentrations of 
nucleotides were used in the reaction mixture and 
the ddNTP/dNTP ratio was reduced to a level (1/3) 
that .is lower than the optimal range for DNA 
sequencing. HiFi Bst-II is shown to have more 
effective ddNTP incorporation. The DNA synthesis 
was often terminated by ddNTP incorporation in the 
HiFi Bst-II mixture, and the result showed uniform 
bands with synthesized small or large DNA 
fragments. As a contrast, HiFi Bst had a lower 
ddNTP incorporation. The DNA synthesis by HiFi Bst 
was less terminated, and most of the synthesized 
products were the larger DNA fragments. 

Example 7: Preparation of Denatured Double - 
stranded DNA Template : 

The following procedure was carried out. 

1. Double -stranded DNA (about 3-5/ig) was 
adjusted to a final volume of lOptl with TE (lOmM 
Tris-HCl, lmM EDTA, pH8.0); 

2. 10/il of 0.4N NaOH, 0.4mM EDTA, was added; 

3. The mixture was incubated at 65 °C for 15 
minutes ; 

4. 2/il of 2M sodium acetate, pH4.5, and 55//1 
cold ethanol was added, and the mixture was placed 
in ice-water bath for 5 minutes ; 

5. The mixture was spun in a microcentrifuge 
at 4°C, 12500rpm for 5 minutes; 

6. The supernatant was drawn off and the 
pellet was washed with 200^1 of 70% ethanol; 
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7. The pellet was dried under vacuum for 2-3 
minutes, and the DNA was dissolved in appropriate 
solution. 

Example 8: DNA sequencing using unmodified Bst 
DNA polymerase/modif ied Bst DNA polymerase with 
radiolabeled dATP for single- or denatured double- 
stranded DNA template 

The following procedure was carried out. 

1. The following components were combined in 
a labeled microcentrifuge tube: 

SxReaction Buffer 2 . O/il 

Primer 1.0/xl (2.5-5.0ng) 

Template 7.0/xl (250-500ng ss DNA 

or 1-3 fig denatured ds DNA) 
The final volume was 10/il. The contents were mixed 
and spun for 2-3 seconds; 

2. The tube were placed in a 75 °C water bath 
for 5 minutes. Then the tubes were allowed to cool 
slowly to ambient temperature over a course of 10 
minutes; 

(Note: Step 2 is optional for single-stranded 
template, and may be omitted at appropriate.) 

3. 1.0/xl of HiFi Bst/HiFi Bst-II (lu//U) and 
l.O/il of [e*- 32 P] dATP was added, and the mixture was 
mixed gently and spun for 2-3 seconds; 

4. 4 tubes "A", "C" , "G", "T" were labelled, 
and 2jil of each premixed nucleotide solution and 
2.5/zl of main mixture {from step 3) was added to 
the respective reaction tube; 

5. The tubes were incubated at 65°C for 2 
minutes; 
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6. 2 . 0/il of 0 . 5mM dNTPs was added to each 
tube, and the tubes were mixed gently, spun for 2- 
3 seconds, and incubated at 65 °C for 2 minutes; 

7. The reactions were stopped by adding 4 . 0/il 
5 of Stop Solution (95% deionized formamide, lOmM 

EDTA, 0.05% xylene cyanol FF, 0.05% bromophenol 
blue) to each tube; 



8. The samples were denatured at 90°C for 2 
minutes, and immediately placed on ice; 



10 9. 2-3/il of the samples were loaded onto each 

lane of 6% (8M urea) sequencing gel, and 
electrophoresis was carried out. 



Note: The Components of the Premixed Nucleotide 
Solutions for HiFi Bst : 

15 A mix: dATP0.62/iM, dCTP 62/zM, dGTP 62/iM, dTTP 

62/iM, ddATP 25/iM; 

C mix: dATP 0.8/iM, dCTP 8/iM, dGTP 80/iM, dTTP 
8 0/iM, ddCTP 50/iM; 

G mix: dATP 0.8/iM, dCTP 80/xM, dGTP 4/iM, dTTP 
20 80/iM, ddGTP 75/zM; 

T mix: dATP 0.8/iM, dCTP 80/iM, dGTP 80/iM, dTTP 
8/iM, ddTTP 150/iM. 

The Components of the Premixed Nucleotide 
Solutions for HiFi Bst-II: 

25 A mix: dATP 0.8/iM, dCTP 80/iM, dGTP 80/iM, dTTP 

80/iM, ddATP 25/iM; 

C mix: dATP 0.8/iM, dCTP 8/iM, dGTP 80/iM, dTTP 
80/iM, ddCTP 20/iM; 

G mix: dATP 0.8/iM, dCTP 80/iM, dGTP 8/iM, dTTP 
30 80/iM, ddGTP 50/zM; 

T mix: dATP 0.8/iM, dCTP 80/iM, dGTP 80/iM, dTTP 
8/iM, ddTTP 50/iM. 
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Figure 3 shows the comparison of 
radiolabeled DNA sequencing with HiFi Bst and HiFi 
Bst-II in their respective optimized reaction 
mixtures. The bands on a DNA sequencing gel with 
HiFi Bst-II were more uniform than those with HiFi 
Bst. The sequence pattern using HiFi Bst-II was 
better than that using HiFi Bst. This made the gel 
with HiFi Bst-II even easier to read. Moreover, 
the ratio of dideoxy- nucleotide concentration to 
deoxy- nucleotide concentration in Premixed 
Nucleotide Solutions for HiFi Bst-II was lower 
than that for HiFi Bst. For instance, the 
ddATP/dATP, ddCTP/dCTP, ddGTP/dGTP and ddTTP/dTTP 
ratios were 40, 6.25, 18.75 and 18.75, 
respectively, for HiFi Bst. The corresponding 
ratios for HiFi Bst-II were 40, 2.5, 6.25 and 
6.25, respectively. Therefore, the concentrations 
of ddNTPs used in the termination reaction are 
reduced to about three- folds after HiFi Bst has 
been genetically modified. 

Example 9: DNA sequencing using unmodified Bst 
DNA polymerase/modif ied Bst DNA polymerase with 
dye -primers for single- or denatured double - 
stranded DNA template 

The following procedure was carried out. 

1. The following was combined: 4.0^1 DNA 
template (300-600ng for ssDNA or 1-2/zg denatured 
ds DNA) with 5.0/il SxReaction Buffer. The mixture 
was mixed and spun for 2-3 seconds in a 
microcentrifuge ; 

2. 4 tubes were labeled "A", "C", "G" , "T" 
and the pre -mixed dATP, dCTP, dGTP, dTTP reagents 
were added to each reaction tube: 

Reaction tube 
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A C G T 

A -REG primer (0.2uM) 1 . Oul 
C-FAM primer (0.2uM) 1 . Oul 

G-TMR primer (0.4uM) 1 . Oul 

T-ROX primer (0.4uM) l.Oul 
A terminator mix 2 . Oul 

C terminator mix 2 . Oul 

G terminator mix 2 . Oul 

T terminator mix 2 . Oul 

5 x Reaction buffer 

with DNA template 2 . Oul 2 . Oul 2 . Oul 2 . Oul 

Total volume 5 . Oul 5 . Oul 5 . Oul 5 . Oul 

3. The tubes were placed in 75 °C bath for 5 
minutes, and allowed to cool slowly to ambient 
temperature over the course of 10 minutes; 
{Note: Step 3 is optional for single-stranded 
template, and may be omitted as appropriate.) 

4. 1/il of HiFi Bst/HiFi Bst-II (0.5u//xl) was 
added to each tube, and the tubes were spun for 2- 
3 seconds ; 

5. The tubes were incubated at 65 °C for 5 
minutes ; 

6 . The contents of the tt A" , "C" , "G" and W T" 
tubes were pooled, and 1.5/zl of 7 . 5M ammonium 
acetate and 55^1 of ethanol was added. The 
mixtured was mixed in a vortex and then placed on 
ice for 20 minutes; 

7. The mixture was centrifuged at 12,500rpm 
for 2 0 minutes at 4°C; 

8. The supernatant was drawn off, and the 
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pellet was washed with 200/U of 70% ethanol; 

9. The pellet was vacuum dried for 2-3 
minutes, and resuspended in 4/zl of loading buffer 
{5:1 deionized formamide : 25mM EDTA with 50mg/ml 
Blue Dextran) ; 

10. The sample was heated at 75 °C for 2-3 
minutes, and immediately placed on ice; 

11. 2-3 /il of sample was loaded onto a lane 
of the 4% (6M urea) sequencing gel, and ABI PRISM™ 
377 DNA Sequencer (from Perkin Elmer) was used to 
collect data. 

Note: Dye primer: DYEnamic Energy Transfer Dye 
Primers (from Amersham) : 

-21 M13 forward: 5 ' -FAM-S T SSSSSTGT*AAAACGACGGCCAGT- 
3' (SEQ ID NO: 11) 
TS=1 ' 2 ' -dideoxyribose 

T*=T attached with Dye 2 (A- REG, C-FAM, G-TMR, T- 
ROX) 

Figure 4 and Figure 5 show the results 
of dye-primer DNA sequencing with HiFi Bst and 
HiFi Bst-II. Both DNA polymerases generated 
similar sequencing results although the peaks on 
the color plot by HiFi Bst II appear to be more 
even in height . 

Example 10: DNA sequencing using unmodified Bst 
DNA polymerase /modified Bst DNA polymerase with 
dye -terminators for single- or denatured double - 
stranded DNA template 

The following procedure was carried out. 

1. The following components were combined in 
a labeled microcentrifuge tube: 
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SxReaction Buffer 



4 . Ofil 

8.0/il (2-3/ig ss DNA 
or denatured 



Template 



ds DNA) 



Primer 



2.0/il (5-10ng) 



10 



15 



20 



25 



The final volume was 14/il. The contents were 
mixed and spun for 2-3 seconds; 

2. The tube was placed in a 75°C water bath 
for 5 minutes ; 

3 . The tube was allowed to cool slowly to 
ambient temperature over a course of 10 minutes ; 
(Note: Steps 2 and 3 are optional for single- 
stranded template, and may be omitted as 
appropriate . ) 

4. 1.0/zl of HiFi Bst/HiFi Bst-II (l-2u/^l) , 
5/il of nucleotides premix (containing Perkin 
Elmer-ABI fluorescent dye-labeled nucleotide 
terminators) , were added and the tube was spun for 
2-3 seconds ,- 

5. The mixture was incubated at 65°C for 10 
minutes ; 

6. 80/il of H 2 0 was added to the reaction mix, 
and the dye terminators were extracted with lOO/il 
of phenol :H 2 0: chloroform (68:18:14) reagent twice. 
The sample was vortexed and centrifuged, and the 
aqueous upper layer was transferred to a clean 



7. To the tube was added 15^1 of 2M sodium 
acetate, pH 4.5, and 3 00/il of ethanol , and the 
tube was vortexed and placed in ice-water bath for 
2 0 minutes; 



tube ; 
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8. The tube was centrifuged with 12,500rpm 
for 2 0 minutes at 4°C; 

. .9. The supernatant was drawn off, and the 
pellet was washed with 200jul of 70% ethanol; 

10. The pellet was vacuum dried for 2-3 
minutes, and resuspended in 4/j1 of loading buffer 
(5:1 deionized formamide : 25mM EDTA with 50mg/ml 
Blue Dextran) ; 

11. The sample was heated at 90 °C for 2-3 
minutes, and immediately placed on ice; 

12. 2-3/il of sample was loaded onto a lane of 
the 4% (6M urea) sequencing gel, and ABI PRISM™ 
377 DNA Sequencer (from Perkin Elmer) was employed 
to collect data, using appropriate amounts of 
nucleotide pre-mixed reagents. 

Figure 6 and Figure 7 show the results of 
dye- terminator DNA sequencing with HiFi Bst and 
HiFi Bst-II. There was data lost in dye- terminator 
DNA sequencing with HiFi Bst, especially the "C" 
after M G" or W A" and "A" after "G" . In Figure 8, 
corrections of the missing or ambiguous bases, 
according to the known pGEM sequence, have been 
indicated below the letters "N" or below the 
incorrect base letters. This problem caused 
ambiguity in DNA sequencing. But it was resolved 
in dye -terminator DNA sequencing with the modified 
Bst DNA polymerase of this invention. 
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WHAT IS CLAIMED IS : 

1. A modified DNA polymerase which 
during DNA sequencing effectively incorporates 
fluorescent dye -labeled dideoxynucleotide 
terminators ddCTP, ddATP, ddTTP and ddGTP, and 
their analogs, and reduces selective 
discrimination against incorporation of 
fluorescent dye- labeled dideoxynucleotide 
terminators ddCTP and ddATP, 

wherein the DNA polymerase in its unmodified 
state selectively discriminates against 
incorporation of fluorescent dye- labeled 
dideoxynucleotide terminators ddCTP and ddATP but 
does not discriminate against incorporation of 
fluorescent dye- labeled dideoxynucleotide 
terminators ddTTP and ddGTP. 

2 . The DNA polymerase according to 
claim 1 which is a modified Bacillus 

s tearo thermophilics DNA polymerase. 

3 . The DNA polymerase according to 
claim 2 which has an amino acid sequence that 
shares not less than 95% homology of a DNA 
polymerase isolated from a strain of Bacillus 
s tearo the rmophi lu s , Bacillus caldotenax or 
Bacillus caldolyticus . 

4 . The DNA polymerase according to 
claim 1 wherein the DNA polymerase is a modified 
DNA polymerase obtained from a mesophilic 
bacterium. 

5 . The DNA polymerase according to 
claim 1 which is a thermostable DNA polymerase 
having proofreading 3' -5' exonuclease activity 
during DNA sequencing of a DNA strand from a 
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template, such that the DNA polymerase functions 
to excise mismatched nucleotides from the 3' 
terminus of the DNA strand at a faster rate than 
the rate at which the DNA polymerase functions to 
remove nucleotides matched correctly with 
nucleotides of the template. 

6. A modified Bacillus 
stearothexwophilus DNA polymerase which during DNA 
sequencing effectively incorporates fluorescent 
dye- labeled dideoxynucleotide terminators ddCTP, 
ddATP, ddTTP and ddGTP, and reduces selective 
discrimination against incorporation of 
fluorescent dye -labeled dideoxynucleotide 
terminators ddCTP and ddATP, 

wherein the DNA polymerase in its unmodified 
state selectively discriminates against 
incorporation of fluorescent dye- labeled 
dideoxynucleotide terminators ddCTP and ddATP but 
does not discriminate against incorporation of 
fluorescent dye -labeled dideoxynucleotide 
terminators ddTTP and ddGTP, 

wherein the DNA polymerase has proofreading 
3' -5' exonuclease activity during DNA sequencing 
of a DNA strand from a template, such that the DNA 
polymerase functions to excise mismatched 
nucleotides from the 3' terminus of the DNA strand 
at a faster rate than the rate at which the DNA 
polymerase functions to remove nucleotides matched 
correctly with nucleotides of the template. 

1. The DNA polymerase according to 
claim 6, which has the amino acid sequence SEQ ID 
NO: 4 . 

8 . The DNA polymerase according to 
claim 6, which is encoded by a DNA segment having 
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the nucleotide sequence of SEQ ID: NO 3. 

9. A host cell which produces a 
modified DNA polymerase which during DNA 
sequencing effectively incorporates fluorescent 
dye-labeled dideoxynucleotide terminators ddCTP, 
ddATP, ddTTP and ddGTP, and reduces selective 
discrimination against incorporation of 
fluorescent dye -labeled dideoxynucleotide 
terminators ddCTP and ddATP, 

wherein the DNA polymerase in its unmodified 
state selectively discriminates against 
incorporation of fluorescent dye- labeled 
dideoxynucleotide terminators ddCTP and ddATP but 
does not discriminate against incorporation of 
fluorescent dye- labeled dideoxynucleotide 
terminators ddTTP and ddGTP. 

10. The host cell according to claim 9, 
wherein the modified DNA polymerase has 
proofreading 3' -5' exonuclease activity during DNA 
sequencing of a DNA strand from a template, such 
that the DNA polymerase functions to excise 
mismatched nucleotides from the 3' terminus of the 
DNA strand at a faster rate than the rate at which 
the DNA polymerase functions to remove nucleotides 
matched correctly with nucleotides of the 
template. 

11. The host cell according to claim 9, 
which produces a DNA polymerase having the amino 
acid sequence SEQ ID NO: 4. 

12. The host cell according to claim 9, 
wherein the DNA polymerase is encoded by the 
nucleotide sequence SEQ ID NO:3. 
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13. The host cell of claim 9, which is 

E. coli . 

14 . A method of sequencing a DNA strand 
comprising the steps of: 

i) hybridizing a primer to a DNA template to 
be sequenced; 

ii) extending the primer using the modified 
DNA polymerase of claim 1, in the presence of 
adequate amounts of nucleotide bases dATP, dGTP, 
dCTP and dTTP , or their analogs, and fluorescent 
dye -labeled dideoxynucleotide terminators, 

under such conditions that the DNA strand is 
sequenced. 

15. The method according to claim 14, 
wherein the DNA polymerase is a modified Bacillus 
stearo thermophilics DNA polymerase. 

16. The method according to claim 14, 
wherein the modified DNA polymerase has an amino 
acid sequence that shares not less than 95% 
homology of a DNA polymerase isolated from a 
strain of Bacillus stearothermophilus , Bacillus 
caldotenax or Bacillus caldolyticus . 

17. The method according to claim 14, 
wherein the modified DNA polymerase has the amino 
acid sequence SEQ ID NO:4. 

18. The method according to claim 14, 
wherein the DNA polymerase is encoded by the 
nucleotide sequence SEQ ID NO: 3. 



19. The method according to claim 14, 
wherein the DNA polymerase is a modified DNA 
polymerase obtained from a mesophilic bacterium. 
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20. The method according to claim 14, 
wherein the DNA polymerase is a thermostable DNA 
polymerase having proofreading 3' -5' exonuclease 
activity during DNA sequencing of a DNA strand 
from a template, such that the DNA polymerase 
functions to excise mismatched nucleotides from 
the 3' terminus of the DNA strand at a faster rate 
than the rate at which the DNA polymerase 
functions to remove nucleotides matched correctly 
with nucleotides of the template. 

21. The method according to claim 20, 
wherein the DNA polymerase is a modified Bacillus 
stearothermophilus , Bacillus caldotenax or 
Bacillus caldolyticus DNA polymerase. 

22. The method according to claim 21, 
wherein the DNA polymerase has the amino acid 
sequence of SEQ ID N0:4. 

23. The method according to claim 21, 
wherein the DNA polymerase is encoded by a DNA 
segment having the nucleotide sequence of SEQ 
ID:N0 3. 

24 . A method for producing a modified 
form of a DNA polymerase which during DNA 
sequencing selectively discriminates against 
incorporation of fluorescent dye -labeled 
dideoxynucleotide terminators ddCTP and ddATP but 
does not discriminate against incorporation of 
fluorescent dye -labeled dideoxynucleotide 
terminators ddTTP and ddGTP, comprising the step 
of 

modifying a DNA polymerase which has an amino 
acid sequence that shares not less than 95% 
homology of a DNA polymerase isolated from a 
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strain of Bacillus stearothexmophilus , Bacillus 
caldotenax or Bacillus caldolyticus , so that the 
modified DNA polymerase includes threonine, 
proline and leucine at positions 342-344, 
respectively, and tyrosine at position 422. 

25. The method according to claim 24, 
wherein the DNA polymerase has proofreading 3' -5' 
exonuclease activity during DNA sequencing of a 
DNA strand from a template, such that the DNA 
polymerase functions to excise mismatched 
nucleotides from the 3' terminus of the DNA strand 
at a faster rate than the rate at which the DNA 
polymerase functions to remove nucleotides matched 
correctly with nucleotides of the template. 

26 . The method according to claim 24 , 
wherein the DNA polymerase has the amino acid 
sequence of SEQ ID: NO 4. 

27. The method according to claim 24, 
wherein the DNA polymerase is encoded by a DNA 
segment having the nucleotide sequence of SEQ 
ID:N0 3 . 

28. A method for producing a modified 
form of a DNA polymerase which during DNA 
sequencing selectively discriminates against 
incorporation of fluorescent dye- labeled 
dideoxynucleotide terminators ddCTP and ddATP but 
does not discriminate against incorporation of 
fluorescent dye -labeled dideoxynucleotide 
terminators ddTTP and ddGTP / comprising the step 
of 

modifying a nucleotide sequence encoding a 
DNA polymerase which has an amino acid sequence 
that shares not less than 95% homology of a DNA 
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polymerase isolated from a strain of Bacillus 
stearo thermophilics or Bacillus caldotenax, so that 
the modified nucleotide sequence encodes 
threonine, proline and leucine residues at 
positions 342-344, respectively, and a tyrosine 
residue at position 422. 

29. A DNA construct comprising: 

(i) a nucleotide sequence encoding a DNA 
polymerase which has an amino acid sequence that 
shares not less than 95% homology of a DNA 
polymerase isolated from a strain of Bacillus 
stearothermophilus or Bacillus caldotenax, which 
nucleotide sequence encodes threonine, proline and 
leucine residues at positions 342-344, 
respectively, and a tyrosine residue at position 
422; and 

(ii) a vector, for introducing the DNA 
construct into euaryotic or procaryotic host 
cells . 

30. The DNA construct according to 
claim 29 wherein the vector is a cloning vector or 
an expression vector. 

31. A kit for direct DNA sequencing 
comprising the modified DNA polymerase of claim 1 
and at least one fluorescent dye- labeled ddNTP. 

32. The kit according to claim 31 
wherein the modified DNA polymerase is a modified 
Bacillus stearothermophilus, Bacillus caldotenax 
or Bacillus caldolyticus DNA polymerase. 

33. The kit according to claim 31 
wherein the modified DNA polymerase has an amino 
acid sequence that shares not less than 95% 
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homology of a DNA polymerase isolated from a 
strain of Bacillus stearothermophilus , Bacillus 
caldotenax or Bacillus caldolyticus . 

34. The kit according to claim 31 
wherein the modified DNA polymerase is modified 
DNA polymerase obtained from a mesophilic 
bacterium. 

35. The kit according to claim 31 
wherein the modified DNA polymerase has 
proofreading 3' -5' exonuclease activity during DNA 
sequencing of a DNA strand from a template, such 
that the DNA polymerase functions to excise 
mismatched nucleotides from the 3' terminus of the 
DNA strand at a faster rate than the rate at which 
the DNA polymerase functions to remove nucleotides 
matched correctly with nucleotides of the 
template . 

36. The kit according to claim 31, 
wherein the modified DNA polymerase is modified 
Bacillus stearotheirmophilus DNA polymerase which 
has proofreading 3' -5' exonuclease activity during 
DNA sequencing of a DNA strand from a template, 
such that the DNA polymerase functions to excise 
mismatched nucleotides from the 3' terminus of the 
DNA strand at a faster rate than the rate at which 
the DNA polymerase functions to remove nucleotides 
matched correctly with nucleotides of the 
template . 

37. The kit according to claim 31 
wherein the modified DNA polymerase has the amino 
acid sequence SEQ ID NO: 4. 

38. The kit according to claim 31 
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wherein the modified DNA polymerase is encoded by 
a DNA segment having the nucleotide sequence of 
SEQ ID: NO 3. 

39. A modified nucleotide sequence 
encoding a DNA polymerase which has an amino acid 
sequence that shares not less than 95% homology of 
a DNA polymerase isolated from a strain of 
Bacillus stearo thermophilics or Bacillus 
caldotenax, which nucleotide sequence encodes 
threonine, proline and leucine residues at 
positions 342-344, respectively, and a tyrosine 
residue at position 422. 

40. The modified nucleotide sequence 
according to claim 39 which has the nucleotide 
sequence of SEQ ID: NO 3. 

41. A method of dye-labeled primer 
sequencing of a DNA strand comprising the steps 
of: 

i) hybridizing a fluorescent dye- labeled 
primer to a DNA template to be sequenced; 

ii) extending the primer using the modified 
DNA polymerase of claim 1, in the presence of 
adequate amounts of nucleotide bases dATP, dGTP, 
dCTP and dTTP, or their analogs, and 
dideoxynucleotide terminators, 

under such conditions that the DNA template 
is sequenced. 

42. The method according to claim 41, 
wherein the DNA polymerase is a modified Bacillus 
stearothexwophilus DNA polymerase. 

43. The method according to claim 41, 
wherein the modified DNA polymerase has an amino 
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acid sequence that shares not less than 95% 
homology of a DNA polymerase isolated from a 
strain of Bacillus stearothermophilus , Bacillus 
caldqtenax or Bacillus caldolyticus . 

44. The method according to claim 41, 
wherein the modified DNA polymerase has the amino 
acid sequence SEQ ID NO: 4. 

45. The method according to claim 41, 
wherein the DNA polymerase is encoded by the 
nucleotide sequence SEQ ID NO: 3. 

46. The method according to claim 41, 
wherein the DNA polymerase is a modified DNA 
polymerase obtained from a mesophilic bacterium. 

47. The method according to claim 41, 
wherein the DNA polymerase is a thermostable DNA 
polymerase having proofreading 3 '-5' exonuclease 
activity during DNA sequencing of a DNA strand 
from a template, such that the DNA polymerase 
functions to excise mismatched nucleotides from 
the 3 ' terminus of the DNA strand at a faster rate 
than the rate at which the DNA polymerase 
functions to remove nucleotides matched correctly 
with nucleotides of the template. 

48. The method according to claim 47, 
wherein the DNA polymerase is a modified Bacillus 
stearothermophilus , Bacillus caldotenax or 
Bacillus caldolyticus DNA polymerase. 

49. A method of radioisotope-labeled 
sequencing of a DNA strand comprising the steps 
of: 

i) providing four separate tubes, each 
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containing DNA template to be sequenced, a primer, 
dATP , dTTP, dCTP and dGTP, wherein one of dATP, 
dTTP, dCTP or dGTP is labeled with a radioisotope, 
and DNA polymerase according r.o claim 1 , 

wherein each tube contains an appropriate 
amount of one of ddATP, ddTTP, ddCTP or ddGTP; 

ii) allowing DNA sequencing of the DNA 
template to be carried out ; and 

iii) determining the sequence of the DNA 
template . 

50. The method according to claim 49, 
wherein the DNA polymerase is a modified Bacillus 
stearo thermophilics DNA polymerase. 

51. The method according to claim 49, 
wherein the modified DNA polymerase has an amino 
acid sequence that shares not less than 95% 
homology of a DNA polymerase isolated from a 
strain of Bacillus stearothermophilus , Bacillus 
caldotenax or Bacillus caldolyticus . 

52. The method according to claim 49, 
wherein the modified DNA polymerase has the amino 
acid sequence SEQ ID NO: 4. 

53. The method according to claim 49, 
wherein the DNA polymerase is encoded by the 
nucleotide sequence SEQ ID NO: 3. 

54. The method according to claim 49, 
wherein the DNA polymerase is a modified DNA 
polymerase obtained from a mesophilic bacterium. 

55. The method according to claim 49, 
wherein the DNA polymerase is a thermostable DNA 
polymerase having proofreading 3' -5' exonuclease 
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activity during DNA sequencing of a DNA strand 
from a template, such that the DNA polymerase 
functions to excise mismatched nucleotides from 
the 3/ terminus of the DNA strand at a faster rate 
than the rate at which the DNA polymerase 
functions to remove nucleotides matched correctly 
with nucleotides of the template. 

56. The method according to claim 55, 
wherein the DNA polymerase is a modified Bacillus 
stearo thermophilics , Bacillus caldotenax or 
Bacillus caldolyticus DNA polymerase. 
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A Model 377 
jg^ Version 2.1.1 

PRISM 



OP-ss Signal C:467 A:227 G;201 T:184 p age A of , 

pGEM-3Zf(+) ET{-21m13fwd}960909 Tue, 20. 1 997 8:43 

ssDNA ET-377 Sample Matrix Mon, 19. 1997 21:09 

Lane 4 Points 1000 to 9000 Base 1: 1000 Spacing: 9.71 ABU 00 



' i\c ~. a " aqqqcqaa ;o~g;.;gg ^jjgggqa 



agac-oa.:- q;agg;a giaao;- gm» *:.::».Mi o::;:: a»* *o:, oqio aa.ca.cc :a:*o: 



"TCCTQTOrOAAAf rOT TArcCQCTCACAAf 'CCACACA *C A TACQAGCCaaAAOCAT A AAQ .O: AAAQ£C I'dOGG .'CCC7AA • QAQ: QAOCTAAC fCACAi * AA f TOCQT 




^Q'^QCTCQGm'CQT'TCGQCTQCGQDGAQCGGT' A TCAGCTGACTCAAAGQ-GG7AA TACGGTTATCCACAOVA!"CAGG<lG\TA ACQGAGOaA AG^ACA fOTGAGCAAAAGQCCAGCAAAAGGCCAG 
, ,-r* uo Kfl J GO 410 l 1 " *M *50 4«0 4T0 4M 
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SEQUENCE LISTING 



<110> HONG, GUOFAN 



HUANG, WEI-HUA 



<12 0> DNA POLYMERASE HAVING ABILITY TO REDUCE INNATE 



SELECTIVE DISCRIMINATION AGAINST FLUORESCENT 



DYE - LABELED D I DEOX YNUCLEOT I DES 



<130> hongsequencelisting 



<140> 09/157,397 



<141> 1998-09-21 
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<150> 08/544,643 



<151> 1995-10-18 



<150> 08/642,684 



<151> 1996-05-03 



<160> 11 



<170> Patentln Ver. 2.0 - beta 



<210> 1 



<211> 1764 



<212> DNA 



<213> Bacillus stearothermophilus 
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gccgaagggg agaaaccgct tgaggagatg gagtctgcca tcgttgacgt cattaccgaa 60 



gagacgcccg ccgacaaggc agcgcttgtc gttgaggtga tggaagaaaa ctaccacgat 120 



gccccgattg ccggaatcgc actagtgaac gagcatgggc gattttttat gcgcccggag 180 



accgcgctgg ctgattcgca atccttagca tggcttgccg atgaaacgaa gaaaaaaagc 240 



atgtttgacg ccaagcgggc agtcgttgcc ttaaagtgga aaggaattga gcttcgcggc 300 



gtcgcctttg at ttactgct cgctgcctat ttgctcaatc cggctcaaga tgccggcgat 360 



atcgctgcgg cggcgaaaat gaaacaatat gaagcggtgc ggccggatga agcggcctat 4 20 



ggcaaaggcg tcaagcggtc gctgccggac gaacagacgc ttgctgagca tctcgt tcgc 4 80 



aaagcggcag ccatttgggc gcttgagcag ccgtttatgg acgatttgcg gaacaacgaa 54 0 



caagatcaac tattaacgaa gcttgagcac gcgctggcgg cgatttcggc tgaaacggaa 600 



ttcactgggg tgaacgtgga tacaaagcgg cttgaacaga tgggttcgga gctcgccgaa 660 



caactgcgtg ccatcgagca gcgcatttac gagccagccg gccaagagtt caacactaac 720 
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tcaccaaaac agctcggagt cactttattt gaaaagctgc agctaccggt gccgaagaag 780 



acgaaaacag gctattcgac tccggctgat gtgcttgaga agcttgcgcc gcatcatgaa 84 0 



accgtcgaaa acattttgca ttaccgccag cttggcaaac cgcaaccaac gcacattgaa 900 



ggattgttga aagttgtgcg ccctgatacc ggcaaagtgc atacgatgtt caaccaagcg 960 



ctgacgcaaa ctgggcggct cagctcggcc gagccgaact tgcaaaacat tccgactcgg 1020 



ctcgaagagg ggcggaaaat ccgccaagcg ttcgtcccgt cagagccgga ccggcccatt 1080 



ttcgccgccg accactcaca aattgaattg cgcgtzcctcg cccatatcgc cgatgacgac 1140 



aatctaattg aagcgttcca acgcgatttg gatattcaca caaaaacggc gatggacatc 1200 



ttccagttga gcgaagagga agtcacggcc aacatgcgcc gccaggcaaa ggccgccaac 1260 



tccggtatcg tttacggaat tagcgattac ggattggcgc aaaacttgaa cattacgcgc 1320 



aaagaagctg ccgaattcat cgaacgttac ttcgccagct ttccgggcgt aaagcagtat 1380 



atggaaaaca tagtgcaaga agcgaaacag aaaggatatg tgacaacgct gttgcatcgg 144 0 



cgccgccatc tgcctgatat tacaagccgc aatttcaacg tccgcagttzt cgcagagcgg 1500 



acggccatga acacgccaat tcaaggaagc gccgctgaca ctattaaaaa agcgacgatt 1560 
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gatttagcgg cacggctgaa agaagagcag ctccaggctc gtcttttgct gcaagtgcat 1620 



gacgagctca ctttggaagc gccaaaagag gaaatcgagc gattatgtga gcttgttccg 1680 



gaagcgatgg agcaggccgt tacgccccgc gtgccgctga aagccgacca ccactacggc 1740 



ccaacatggt atgatgccaa ataa 1764 



<210> 2 



<211> 587 



<212> PRT 



<213> Bacillus stearothermophi lus 



<400> 2 



Ala Glu Gly Glu Lys Pro Leu Glu Glu Met Glu Phe Ala lie Val Asp 



IS 10 15 
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Val lie Thr Glu Glu Met Leu Ala Asp Lys Ala Ala Leu Val Val Glu 



20 25 30 



Val Met Glu Glu Asn Tyr His Asp Ala Pro lie Val Gly He Ala Leu 
35 40 45 



Val Asn Glu His Gly Arg Phe Phe Met Arg Pre Glu Thr Ala Leu Ala 
50 55 €0 



Asp Ser Gin Phe Leu Ala Trp Leu Ala Asp Glu Thr Lys Lys Lys Ser 
65 70 75 80 



Met Phe Asp Ala Lys Arg Ala Val Val Ala Leu Lys Trp Lys Gly lie 
85 90 95 
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Glu Leu Arg Gly Val Ala Phe Asp Leu Leu Leu Ala Ala Tyr Leu Leu 



105 110 



Asn Pro Ala Gin Asp Ala Gly Asp lie Ala Ala Val Ala Lys Mec Lys 



120 125 



Gin Tyr Glu Ala Val Arg Ser Asp Glu Ala Val Tyr Gly Lys Gly Val 



130 135 140 



Lys Arg Ser Leu Pro Asp Glu Gin Thr Leu Ala Glu His Leu Val Arg 



145 150 155 160 



Lys Ala Ala Ala lie Trp Ala Leu Glu Gin Pro Phe Met Asp Asp Leu 
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165 170 175 



Arg Asn Asn Glu Gin Asp Gin Leu Leu Thr Lys Leu Glu His Ala Leu 



180 185 190 



Ala Ala He Leu Ala Glu Met Glu Phe Thr Gly Val Asn Val Asp Thr 



195 200 205 



Lys Arg Leu Glu Gin Met Gly Ser Glu Leu Ala Glu Gin Leu Arg Ala 



210 215 220 



He Glu Gin Arg He Tyr Glu Leu Ala Gly Gin Glu Phe Asn lie Asn 



225 230 235 240 
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Ser Pro Lys Gin Leu Gly Val He 



245 



Val Leu Lys Lys Thr Lys Thr Gly 



260 



Glu Lys Leu Ala Pro His His Glu 
275 280 



Arg Gin Leu Gly Lys Leu Gin Ser 
290 295 



Val Val Arg Pro Asp Thr Gly Lys 
305 310 



Leu Phe Glu Lys Leu Gin Leu Pro 
250 255 

Tyr Ser Thr Ser Ala Asp Val Leu 
265 270 

He Val Glu Asn lie Leu His Tyr 
285 

Thr Tyr He Glu Gly Leu Leu Lys 
300 

Val His Thr Met Phe Asn Gin Ala 
315 320 
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Leu Thr Gin Thr Gly Arg Leu Ser 



325 



lie Pro lie Arg Leu Glu Glu Gly 



310 



Pro Ser Glu Pro Asp Trp Leu lie 
355 360 



Glu Leu Arg Val Leu Ala His He 
370 375 



Ala Phe Gin Arg Asp Leu Asp lie 



Ser Ala Glu Pro Asn Leu Gin Asn 
330 335 



Arg Lys He Arg Gin Ala Phe Val 
345 350 



Phe Ala Ala Asp Tyr Ser Gin He 
365 



Ala Asp Asp Asp Asn Leu He Glu 
380 



His Thr Lys Thr Ala Met Asp He 
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385 390 395 400 



Phe Gin Leu Ser Glu Glu Glu Val Thr Ala Asn Met Arg Arg Gin Ala 



405 410 415 



Lys Ala Val Asn Phe Gly lie Val Tyr Gly lie Ser Asp Tyr Gly Leu 



420 425 430 



Ala Gin Asn Leu Asn lie Thr Arg Lys Glu Ala Ala Glu Phe lie Glu 



435 440 445 



Arg Tyr Phe Ala Ser Phe Pro Gly Val Lys Gin Tyr Met Glu Asr. lie 



450 455 460 
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Val Gin Glu Ala Lys Gin Lys Gly Tyr Val Thr Thr Leu Leu His Arg 



465 170 475 480 



Arg Arg Tyr Leu Pro Asp lie Thr Ser Arg Asp. Phe Asn Val Arg Ser 



485 490 495 



Phe Ala Glu Arg Thr Ala Met Asn Thr Pro lie Gin Gly Ser Ala Ala 



500 505 510 



Asp lie lie Lys Lys Ala Met lie Asp Leu Ala Ala Arg Leu Lys Glu 



515 520 525 



Glu Gin Leu Gin Ala Arg Leu Leu Leu Gin Val His Asp Glu Leu lie 



530 535 540 
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Leu Glu Ala Pro Lys Glu Glu He Glu Arg Leu Cys Glu Leu Val Pro 
545 550 555 560 

Glu Val Met Glu Gin Ala Val Thr Leu Arg Val Pro Leu Lys Val Asp 
565 570 575 

Tyr His Tyr Gly Pro Thr Trp Tyr Asp Ala Lys 
580 585 
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<213> Bacillus s tearothermophilus 



<400> 3 



atggccgaag gggagaaacc gcttgaggag acggagtntg ccatcgtcga cgtcattacc 60 



gaagagatgc ttgccgacaa ggcagcgctc gtcgttgagg tgatggaaga aaactaccac 120 



gatgccccga ttgtcggaat cgcactagtg aacgagcatg ggcgatttt t natgcgcccg 180 



gagaccgcgc tggctgattc gcaactttta gcatggcttg ccgatgaaac gaagaaaaaa 240 



agcatgtttg acgccaagcg ggcagtcgcc gccttaaagt ggaaaggaac tgagcttcgc 300 



ggcgtcgcct ccgacttact gctcgctgcc tatttgctca atccggctca agatgccggc 360 



gatatcgctg cggcggcgaa aatgaaacaa tatgaagcgg tgcggtcgga tgaagcggtc 420 



tacggcaaag gcgtcaagcg gccgctgccg gacgaacaga cgcttgctga gcatctcgcc 4 80 



cgcaaagcgg cagccatttg ggcgcttgag cagccgttta tggacgattt gcggaacaac 54 0 



gaacaagatc aattattaac gaagcttgag cacgcgctgg cggcgacttt ggctgaaacg 600 



gaattcactg gggtgaacgc ggatacaaag cggcttgaac agatgggtcc ggagcccgcc 660 
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gaacaactgc gtgccatcga gcagcgcact tacgagctag ccggccaaga gttcaacatt 720 
aactcaccaa aacagctcgg agtcatttta cttgaaaagc tgcagctacc ggtgctgaag 780 
aagacgaaaa caggctattc gacttcggct gatgtgcttg agaagcttgc gccgcatcat 840 
gaaatcgtcg aaaacattct gcattaccgc cagcttggca aactgcaatc aacgnatatt 900 
gaaggattgt tgaaagttgt gcgccctgat accggcaaag tgcatacgat gtccaaccaa 960 
gcgctgacgc aaactgggcg gctcagctcg gccgagccga acttgcaaaa cattccgact 1020 
cggaccccac tggggcggaa aatccgccaa gcgttcgtcc cgtcagagcc ggactggctc L080 
atcttcgccg ccgaccactc acaaattgaa ctgcgcgtcc tcgcccatat cgccgatgac 1140 
gacaatctaa ttgaagcgtt ccaacgcgat ttggatatnc acacaaaaac ggcgatggac 1200 
attttccagt tgagcgaaga ggaagccacg gccaacatgc gccgccaggc aaaggccgtt 1260 
aactacggta ccgtttacgg aattagcgat tacggaccgg cgcaaaactt gaacattacg 1320 
cgcaaagaag ccgccgaatt tatcgaacgt tacttcgcca gctttccggg cgcaaagcag 1380 
tatacggaaa acatagtgca agaagcgaaa cagaaaggat atgtgacaac gctgttgcat 1440 
cggcgccgcc atttgcctga tatnacaagc cgcaatttca acgtccgcag ttttgcagag 1500 
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cggacggcca tgaacacgcc aacccaagga agcgccgctg acattatcaa aaaagcgatg 1560 



attgacctag cggcacggct gaaagaagag cagcttcagg ctcgtct ttt gctgcaagtg 1620 



catgacgagc tcattttgga agcgccaaaa gaggaaattg agcgattatg tgagcttgtt 1680 



ccggaagcga cggagcaggc cgttacgctc cgcgtgccgc tgaaagtcga ccaccattac 174 0 



ggcccaacat ggtatgatgc caaa 



<211> 588 



<212> PRT 



<213> Bacillus stearothermophilus 



<400> 4 

Met Ala Glu Gly Glu Lys Pro Leu Glu Glu Met Glu Phe Ala lie Val 
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Asp Val He Thr Glu Glu Met Leu Ala 
20 25 



Glu Val Met Glu Glu Asn Tyr His Asp 
35 40 



Leu Val Asn Glu His Gly Arg Phe Phe 
50 55 



Ala Asp Ser Gin Phe Leu Ala Trp Leu 
65 70 
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Asp Lys Ala Ala Leu Val Val 



30 



Ala Pro lie Val Gly lie Ala 



45 



Met Arg Pro Glu Thr Ala Leu 



60 



Ala Asp Glu Thr Lys Lys Lys 



75 80 



Ser Met Phe Asp Ala Lys Arg Ala Val Val Ala Leu Lys Trp Lys Gly 
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85 90 95 

lie Glu Leu Arg Gly Val Ala Phe Asp Leu Leu Leu Ala Ala Tyr Leu 
100 105 HO 

Leu Asn Pro Ala Gin Asp Ala Gly Asp lie Ala Ala Val Ala Lys Mec 
115 120 125 

Lys Gin Tyr Glu Ala Val Arg Ser Asp Glu Ala Val Tyr Gly Lys Gly 
130 135 140 

Val Lys Arg Ser Leu Pro Asp Glu Gin Thr Leu Ala Glu His Leu Val 
145 150 155 160 



18 
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Arg Lys Ala Ala Ala He Trp Ala 
165 

Leu Arg Asn Asn Glu Gin Asp Gin 
180 

Leu Ala Ala He Leu Ala Glu Met 
195 200 

Thr Lys Arg Leu Glu Gin Met Gly 
210 215 

Ala He Glu Gin Arg He Tyr Glu 
225 230 
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Leu Glu Gin Pro Phe Met Asp Asp 



170 175 



Leu Leu Thr Lys Leu Glu His Ala 



185 190 



Glu Phe Thr Gly Val Asn Val Asp 



205 



Ser Glu Leu Ala Glu Gin Leu Arg 



220 



Leu Ala Gly Gin Glu Phe Asn lie 



235 240 
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Asn Ser Pro Lys Gin Leu Gly Val lie Leu Phe Glu Lys Leu Gin Leu 
245 250 255 



Pro Val Leu Lys Lys Thr Lys Thr Gly Tyr Ser Thr Ser Ala Asp Val 
260 265 270 



Leu Glu Lys Leu Ala Pro His His Glu He Val Glu Asn lie Leu His 
275 280 285 



Tyr Arg Gin Leu Gly Lys Leu Gin Ser Thr Tyr lie Glu Gly Leu Leu 



290 295 300 



Lys Val Val Arg Pro Asp Thr Gly Lys Val His Thr Met Phe Asn Gin 



20 
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305 310 



Ala Leu Thr Gin Thr Gly Arg Leu 



325 



Asn He Pro lie Arg Thr Pro Leu 



340 



Val Pro Ser Glu Pro Asp Trp Leu 
355 360 



He Glu Leu Arg Val Leu Ala His 
370 375 
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315 320 



Ser Ser Ala Glu Pro Asn Leu Gin 
330 335 



Gly Arg Lys He Arg Gin Ala Phe 
345 350 



He Phe Ala Ala Asp Tyr Ser Gin 
365 



He Ala Asp Asp Asp Asn Leu He 
380 
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Glu Ala Phe Gin Arg Asp Leu Asp lie 



385 390 



lie Phe Gin Leu Ser Glu Glu Glu Val 



405 



Ala Lys Ala Val Asn Tyr Gly lie Val 



420 425 



Leu Ala Gin Asn Leu Asn He Thr Arg 



435 440 



Glu Arg Tyr Phe Ala Ser Phe Pro Gly 



450 455 
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His Thr Lys Thr Ala Met Asp 
395 400 

Thr Ala Asn Met Arg Arg Gin 
410 415 

Tyr Gly He Ser Asp Tyr Gly 
430 

Lys Glu Ala Ala Glu Phe lie 
445 

Val Lys Gin Tyr Me: Glu Asn 
460 
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He Val Gin Glu Ala Lys Gin Lys Gly Tyr Val Thr Thr Leu Leu His 
465 470 475 480 



Arg Arg Arg Tyr Leu Pro Asp He Thr Ser Arg Asn Phe Asn Val Arg 
485 490 495 



Ser Phe Ala Glu Arg Thr Ala Met Asn Thr Pro He Gin Gly Ser Ala 
500 505 510 



Ala Asp lie He Lys Lys Ala Met He Asp Leu Ala Ala Arg Leu Lys 
515 520 525 



Glu Glu Gin Leu Gin Ala Arg Leu Leu Leu Gin Val His Asp Glu Leu 
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530 535 540 



lie Leu Glu Ala Pro Lys Glu Glu lie Glu Arg Leu Cys Glu Leu Val 



545 550 555 560 



Pro Glu Val Met Glu Gin Ala Val Thr Leu Arg Val Pro Leu Lys Val 



565 570 575 



Asp Tyr His Tyr Gly Pro Thr Trp Tyr Asp Ala Lys 



580 585 



<210> 5 



<211> 17 
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<212> DNA 



<213> Bacillus stearothermophi lus 



<400> 5 



cattttgctg ccggtca 17 



<211> 20 



<212> DNA 



<213> Bacillus stearochermophi lus 



<400> 6 



gtaaaacgac ggccagtctt 20 
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<210> 7 



<211> 20 



<212> DNA 



<213> Bacillus stearothermophi lus 



gcaaaacgac ggccagtcgg 2 0 



<210> 8 



c211> 20 



<212> DNA 



c213> Bacillus stearothermophi lus 



<400> 8 
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cattttgctg ccggtcagaa 20 



<210> 9 



<211> 26 



<212> DNA 



<213> Bacillus stearothermophi lus 



<400> 9 



gccgttaact acggtatcgt tcacgg 



<210> 10 



<211> 36 



<212> DNA 



<213> Bacillus stearothermophilus 
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<400> 10 



cattccgatt cggaccccac tggggrggaa aatccg 



<210> 11 



<211> 24 



<212> DNA 



<213> Bacillus stearochermothilus 



<400> 11 



sssssstgta aaacgacggc cagt 
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